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Abstract. The development of effective knowledge discovery techniques has become a very active research area 
in recent years due to the important impact it has had in several relevant application domains. One interesting task 
therein is that of singling out anomalous individuals from a given population, e.g., to detect rare events in time- 
series analysis settings, or to identify objects whose behavior is deviant w.r.t. a codified standard set of rules. Such 
exceptional individuals are usually referred to as outliers in the literature. 

In this paper, the concept of outlier is formally stated in the context of knowledge-based systems, by generalizing 
that originally proposed in [7] in the context of default theories. The chosen formal framework here is that of 
logic programming, wherein potential applications of techniques for outlier detection are thoroughly discussed. 
The proposed formalization is a novel one and helps to shed light on the nature of outliers occurring in logic bases. 
Also the exploitation of minimality criteria in outlier detection is illustrated. 

The computational complexity of outlier detection problems arising in this novel setting is also thoroughly investi- 
gated and accounted for in the paper. Finally, rewriting algorithms are proposed that transform any outlier detection 
problem into an equivalent inference problem under stable model semantics, thereby making outlier computation 
effective and realizable on top of any stable model solver. 



Keywords: outlier detection, logic programming, knowledge representation, nonmonotonic reasoning, com- 
putational complexity. 



1 Introduction 



1.1 Statement of the Problem 

The development of effective knowledge discovery techniques has become a very active research area in 
recent yeas due to the important impact it has had in several relevant application areas. Knowledge discovery 
comprises quite diverse tasks and associated methods. One interesting task therein is that of singling out 
anomalous individuals from a given population, e.g., to detect rare events in time-series analysis settings, or 
to identify objects whose behavior is deviant w.r.t. a codified standard set of "social" rules. Such exceptional 
individuals are usually referred to as outliers in the literature. 

Outlier detection has important applications in bioinformatics [1], fraud detection [33,34], and intru- 
sion detection [32,48], just to cite a few. As a consequence, several approaches have been aheady devel- 
oped to reahze outher detection, mainly by means of data mining techniques including clustering-based and 
proximity-based methods as well as domain density analysis (see, e.g., [11, 3, 16,55, 8, 12]). Usually, these 
approaches model the "normal" behavior of individuals by performing some statistical kind of computation 
on the given data set (various methods basically differ on the basis of the way such computation is carried 
out) and, then, single out those individuals whose behavior or characteristics "significantly" deviate from 
the normal ones. 

As a first, quite simple, example of outher detection, consider the following short story: Nino is a young 
soccer player from Southern Italy. He has black hair and brown eyes. Where Nino lives, all people have 
black hair and brown eyes. Nino has only one big passion, that is, playing soccer and he does not know 
much about the world outside his home town. One day the boss of a team from Finland happens to see 
Nino playing and likes him so much that he decides to propose a contract for Nino to play with his team in 
Finland. A good wage is proposed, so Nino accepts, being also amazed by the strange appearance of this 
guy coming from Finland. He has blonde hair and blue eyes, indeed: 'He must really be a strange man', 
Nino must have thought. The day then arrives for Nino to move to Finland. His new boss welcomes him at 
the airport in Helsinki and, together, they go to the football ground. In a large hall, the boss introduces Nino 
to his colleagues and to other people working for the team. To his great surprise Nino notices that among 
so many people in the hall there is just one person with black hair and brown eyes: that is, Nino himself ! 

The above example clarifies some simple and yet important concepts. At the beginning, Nino considered 
the team boss from Finland an outlier, on the basis of the statistical evidence that all the people he had 
ever seen had a different complexion. But at the end of the story, on the basis of the new observations he 
has acquired in the hall, the team boss no longer appears to Nino as a strange individual. Thus, the story 
firstly illustrates that outliers may be singled out precisely because there is a second set of data determining 
their abnormality, which below will be called outlier witness. Hence, there is no outlier without a witness. 
Moreover, the fact that at the end Nino himself appears to be an outlier with respect to his new colleagues 
sheds some light on another aspect of outlier detection. Indeed, the story also indicates that an individual 
might well be an outlier in one context, i.e., with respect to a given set of observations, while not being so 
in a different one. It follows that while looking for outliers in general no assumption can be made on the 
existence of properties characterizing outliers "per se". Rather, outliers are to be detected only on the basis 
of observations to hand, by eventually singhng out some properties standing out for their abnormality. 

In the short story, abnormality is just the result of a kind of implicitly computed statistic, which is 
precisely what is commonly assumed by most of the approaches in the literature making use of "quantitative" 
aspects of the observations only. However, while looking at a set of observations to discover outliers, it often 
happens that there are some "quahtative" descriptions of the domain of interest, encoding, e.g., what an 
expected normal behavior should be. As an example of such a description, which will be called background 
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knowledge in the following, there might be a rule stating that people from Southern Italy have black hair and 
brown eyes, thereby precisely encoding Nino's observations about people from his place. This description 
might be, for instance, derived by an expert and might be formalized by means of a suitable language for 
knowledge representation, as shown in the following example, where logic programs under the stable model 
semantics are considered. 

Example 1. Suppose that during a visit to Australia you notice a mammal, say Donald, that you classify as 
a platypus because of its graceful, yet comical appearance. However, it seems to you that Donald is giving 
birth to a young, but this is very strange given that you know that a platypus is a peculiar mammal that 
lays eggs. Formalizing this scenario as an outlier detection problem is simple. Indeed, observations can be 
encoded as the facts {Mammal (Donald), GiveBirth(Donald), Platypus (Donald)} and the additional 
knowledge by means of the following logical rule: 

Platypus (X) <— Mammal (X), not GiveBirth(X). 

It is worthwhile noting that if you had not observed that Donald was a platypus, you would not have 
inferred such a conclusion, given your background knowledge and the fact GiveBirth(Donald). However, 
if for some reason you have doubted the fact that Donald was giving birth to its young, then it would come 
as no surprise that Donald was a platypus. Therefore, the fact GiveBirth(Donald) is precisely recognized 
to represent an outlier, whose anomaly is indeed witnessed by the fact Platypus(Donald). < 

Though still very simple, the example above conveys some of the relevant features occurring in the ex- 
ploitation of a background knowledge for detecting outliers in a given set of observations. First, the example 
evidences once again that abnormahty of outliers is not defined in "absolute" terms, i.e., there is no explicit 
encoding for the exceptions in the theory. In particular, outliers show up as kinds of anomalous individuals 
that do not necessarily lead to a (logical) conflict, so that their isolation caimot be carried out in general by 
exploiting inconsistencies between observations and the knowledge at hand. For instance, with the encoding 
of the background knowledge in Example 1 , there is no problem in assuming that all the three observations 
hold: the logical rule is not inconsistent with the observations, and a model in fact exists. In this respect, 
there is no exphcit need to revise the knowledge and/or the observations. 

Indeed, in outlier detection, the fact that an individual is an outlier has to be witnessed by a suitable as- 
sociated set of observations, the outlier witness, which shows why the individual "deviates" from normality. 
While in traditional (data-mining) approaches an individual to deviate is formalized on the basis of some 
sort of statistics on the data, making use of quaUtative descriptions about the domain of interest allows us to 
exploit a rich, logic-based framework. 

Specifically, the choice here is to assume that outlier witnesses are sets of facts that are "normally" 
(that is in the absence of the outliers) explained by the domain knowledge which, in turns, entails precisely 
the opposite of the witnesses whenever outliers are not singled out. In this way, witnesses are meant to 
precisely characterize the abnormality of outliers with respect to both the theory and the other data at hand. 
For instance, in Example 1, Platypus (Donald) is a witness for the outlier GiveBirth(Donald), since 
Platypus (Donald) cannot be predicted by the theory given that one trusts in the fact GiveBirth(Donald), 
but is immediately entailed as soon as GiveBirth(Donald) is thrown out. 

It is worthwhile noting that from this perspective, outlier detection may be abstractly seen as a form 
of diagnostic reasoning where diagnosis of anomalies has to be identified on the basis of some kind of 
disagreement with the background knowledge and must be supported by some further evidence in the data 
(such evidences being encoded in witness sets) — connections with this form of reasoning will be further 
clarified in the following with some examples (cf. Example 3) and discussions. 



2 



Clearly enough, detecting outiiers by exploiting a logical characterization of the domain of interest is 
generally more complex than it appears from Example 1 . Therefore, in the presence of complex and richer 
background knowledge, some automatic mechanisms for outUer identification via logic theories is defini- 
tively needed. Formalizing these kinds of mechanism, discussing their complexity and providing computa- 
tion algorithms is, in a nutshell, the contribution of this paper. In particular, the formalization here is carried 
out by exploiting logic programs under the stable model semantics. This means that the background knowl- 
edge, that is, what is known in general about the world (also called in the following rule component), and 
the observations, that is, what is currently perceived about (possibly, a portion of) the world (also called in 
the following observation component), are respectively encoded in the form of a logic program and a set of 
facts under the stable models semantics. 

In order to make the framework clearer, before detaihng the major contributions of this paper, more 
examples of outlier detection are provided and relationships with some related reasoning mechanisms are 
discussed next. 

1.2 Examples of Outlier Detection 

Considering outlier detection problems in the presence of background knowledge may have several use- 
ful applications. For instance, the concept of the outlier in database applications might be quite natural in 
encoding such important tasks as maintaining database integrity through updates as discussed below. 

While designing relational database applications, relations are often equipped with some kind of in- 
tegrity consttaints in order to enhance their expressiveness. In normal operative conditions databases are 
assumed to be consistent, i.e., to satisfy all the integrity constraints. However, it may happen that this is 
not the case, especially when data is the result of the integration of several autonomous sources such as 
in datawarehouse and data integration contexts. In fact, the problem of identifying and handling inconsis- 
tencies in databases has recently received a lot of attention (cf. [65, 14, 38, 62, 20]). Actually, most of the 
proposed approaches define constraints using suitable fragments of first order logics, and mainly focus on 
very simple constraints on data, such as functional dependencies and inclusion dependencies (see, e.g., [2]). 
In those cases, constraint violations may be identified by means of simple queries over the database at hand. 
However, more sophisticated forms of constraints modelling specific knowledge of the application domain 
are left generally unexpressed. These kinds of constraint may encode, e.g., organizational rules or routinely 
appUed praxis, which should be formaUzed in order to construct a more precise modelling of the domain. In 
these cases, the problem is that there is no obvious way for checking database integrity. 

Outlier detection can in fact be used to examine database integrity by allowing for more sophisticated, 
appUcation-oriented forms of constraints. If an abnormal property is discovered in a database, i.e., a violation 
of some constraint, the data source which reported this observation would have to be double-checked. An 
example follows. 

Example 2. Consider a bank B. The bank approves loan requests put forward by customers on the basis of 
certain policies. As an example, assume that loan approval policies prescribe that loans for amounts greater 
than 50K Euro have to come along with an endorsement provided as a guarantee by a third party. Information 
about approved loan requests is stored in a number of relational tables, that are: 

- ReqLoan(Loan Id, Customer, Amount), that records loan requests; 

- Endorsement(Loan Id, Party), that records the guaranteeing parties for loan requests for more 
than 50K Euro; 

- Approved(Loan Id), that records approved loan requests. 
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Fig. 1. Example instance of the bank database. 



Moreover, the bank stores information about unreliable customers in the table Unreliable(Customer) 
collecting data from an external data source, such as, for example, consulting agencies providing information 
about the financial records of individuals and companies. In Figure 1, an instance of the bank database is 
reported. 

With this knowledge to hand, the bank policy concerning loan approvals can be easily encoded using a 
logical rule Uke the following: 

Approved(L) <— ReqLoan(L, C, A), A > 50.000, Endorsement (L, P), not Unreliable(P). 

According to this knowledge, and by using the straightforward and well-known correspondence be- 
tween relational tuples and logical facts (e.g., the tuple (/i) in relation Approved corresponds to the fact 
Approved(ii)), it might be noticed that there is something strange with the loan l\ in Figure 1. In fact, since 
the loan has been approved, the party pi is supposed to be reUable. However, this is not the case, as emerges 
by looking at the database provided by the consulting agency (see table Unreliable). 

Notice that if the fact Approved(Zi) were dropped, the exact opposite would have been concluded, 
namely that the loan li is not in the relation APPROVED, i.e., according to the normal behavior of the bank, 
the loan li is not to be approved. Furthermore, if both the facts Unreliable(pi) and Approved(Zi) were 
dropped, again it would be concluded that h might be approved. This implies that the loan request l\ not 
being approved is a consequence of the fact that pi is not reliable, and hence Unreliable(pi) is an outlier 
witnessed by Approved(ii). This entails that, in the above scenario, the bank has not trusted the consulting 
agency since the bank has actually decided that pi is to be considered indeed reliable. < 

Further potential applications of the logic-based framework for outlier detection comes in the context 
of enhancing the reasoning capabilities of autonomous agents. Indeed, recall that computational logics have 
been successfully exploited recently in the context of agent systems applications, since it is quite an effective 
and powerful way for both modelling and prototypically implementing several forms of reasoning schemes 
in such systems [45,85,83]. For instance, abductive logic programming approaches have been fruitfully 
used to allow an agent to make hypotheses about the outer world and causes of observable events (e.g, 
[21]), modal logic operators have been exploited to describe and realize agent behavior and to put it into 
relationship with other agents in an agent society (e.g., [74]), and inductive techniques have been proposed 
to enable an agent to "learn" from experience (e.g, [15]). 

While exploiting different syntaxes and specific characteristics of the domains of interest, most of these 
proposals share the same basic structural vision of the agent, which is often assumed to have its own, 
trustable background knowledge about the world that is encoded in the form of a suitable theory. Then, after 
some observations have been obtained describing the actual status of the outer environment, the agent might 
try to achieve its application goals by performing some suitable reasoning tasks on the basis of both its 
background knowledge and the observations. In usual agent operative conditions, one may assume that the 
background knowledge and the observations logically harmonize. However, situations may occur for which 
this is not the case. In such circumstances, it is desirable that the agent that has noticed some mismatch is 
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n : down(X) ^ computer(X), not predecessorUp(X). 

r2 : predecessorUp(X) ^wired(Y, X), up(Y). 

ra : up(X) ^ computer (X), not down(X). 

r4 : computer (s). computer (a). ••■ computer(t). 

rs : wired(s, a). ■■■ wired(g, t). 

re : up(s). 

witness set 

Fig. 2. Computer network example. 

also able to revise its background knowledge by incorporating the new evidence of the world. In particular, 
in some cases the background knowledge and observations mismatch might be caused because there is 
something "wrong" or "anomalous" with the observations themselves. This might happen, for instance, as 
a consequence of noise occurring while sensing the outer environment or even because of the existence of 
malicious agents supplying wrong information, and outlier detection may be fruitfully exploited to identify 
such situations. 

Clearly, the isolation of such anomalous observations offers a potentially useful source of additional 
knowledge that might be exploited to better support the interaction among agents or simply to optimize the 
way an agent pursues its goals. Indeed, outliers might be of a great interest, for they put into evidence the 
presence of some situations possibly requiring an alert or a quick reaction by the agent. A detailed example 
follows. 

Example 3. Consider an agent A that is in charge of monitoring the connectivity status of the computer 
network J\f shown on the right of Figure 2. The agent's background knowledge is modelled by a logic 
program PJ^'', which is used by A for assessing whether the computer s is connected to t. Program Pj^" {rule 
component) consists of the rules shown on the left of Figure 2. In P^^^, each computer, say X, is represented 
by an atom computer(X), and connections among computers are denoted by means of the binary relation 
wired. Moreover, for each computer X, the fact down(X) (resp. up(X)) encodes that X is offline (resp. online). 

The topology of the network is encoded into the facts in and rs. The meaning of the other rules in 
is as follows. Rule ri defines the unary predicate down in a way that down(X) is true if there exists 
no predecessor of the computer X (in the network) which is up; having a predecessor that is up is encoded 
in the unary predicate predecessorUp defined in rule r2 that tells that a computer Y is a predecessor of X 
in the case wired(Y, X) holds, and therefore PredecessorUp(X) is true if there is a computer Y such that 
wired(Y, X) is true and such that Y is up. Rule says that a computer is up if it is not down. Finally, the 
fact in rg states that the source s is known to be up. 

In order to monitor the net, A observes the actual status of each computer X in the net. Hence, the agent 
A has such an evidence modelled in the observation component by means of a set of facts, say Pf\ over 
the predicates down and up. 

It is important to note that program encodes the normal behavior of the network and does not 
explicitly account for possible anomalies. Its intended meaning is that, in usual operative conditions, a 
computer X is down if and only if there is no path in J\f connecting s to X only going through computers 
that are up. Thus, it is sufficient that any such path exists to have X be supposedly observed up. Armed with 
this knowledge, the agent is interested in singling out the observations that are anomalous according to the 
"normal" behavior of the system, modelled by P^^^. 
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Assume now that, for instance, P^^^ comprises the facts {dowii(a), up(b), dowii(c), up(d), 
up(e), up(f ), up(g), up(h), up(t)} — in the figure, the computers observed to be down are marked in bold. 
It is important to note that the program on the left of Figure 2 is stratified and, therefore, a stable model 
[42] always exists no matter what observations are added to it. Thus, no conflict (in the sense of classical 
inconsistency) may arise for the program P^^ and the observations in P^^. However, two strange things 
indeed come into play with the observations. 

First, down(a) comes true in P^*^^, which is not predicted by PJ^. This observation may be clearly 
viewed as "unexpected" according to the background knowledge, even though it formally does not lead to an 
inconsistency. However, we do not have any argument for classifying it as a wrong observation. Indeed, this 
kind of disagreement may actually have happened for several reasons, for instance, because the background 
knowledge just encodes a partial description of the world and the observations involve some aspects which 
are not dealt with therein. E.g., the background knowledge may not encode the fact that a computer may be 
down also if some internal failure occurred, which does not depend on connectivity in the network. 

In addition, there is a second strange aspect in the observations, which is the fact of c also being down. 
In this second case, however, the abnormality of this circumstance is also witnessed by some other observa- 
tions. Specifically, if A did not observe that computers d, e, f , g, and t are up, it would have inferred exactly 
the opposite conclusions (that are that d, e, f , g, and t are down) by exploiting its background knowledge, 
since the failure of c suffices for breaking the s-t connectivity. Thus, in this second case, there is some 
further evidence that the observation down(c) is wrong. It follows that without any additional knowledge 
about the system, by reasoning this way and given the support of the observations on computers d, e, f , g, 
and t, a quite rehable diagnosis can be made. 

In the framework that follows, the computer c being down is precisely recognized to represent an outUer, 
while the set W is an outlier witness, i.e., a set of facts which can be indeed explained by the rule component 
if and only if the outliers are not trusted in the observations. < 

1.3 Comparison with Other Reasoning Tasks 

In the light of the informal discussion and of the examples presented so far, one may wonder what kinds of 
connection outlier detection has with other well-known reasoning tasks. In particular, it should be evident 
that outlier detection shares some features with some well-known and studied problems in AI literature, such 
as belief revision and diagnosis. OutUer detection can be indeed abstractly seen as a form of revision since 
its ultimate goal is to logically harmonize the background theory with the observations to hand. Moreover, 
it can be also perceived as a form of diagnosis, because the main interest is in individuals that do not behave 
as predicted from the background knowledge. 

Next the relationships with both belief revision and diagnosis are discussed. Specifically, given that 
diagnosis cames in different forms in the literature, two of the main approaches pursued in the AI community 
(cf. [72]) will be focused on, that are consistency-based and abductive diagnosis. 

(Consistency-Based) Diagnosis. Diagnosis can be defined as the problem of finding what is wrong with 
some possibly malfunctioning systems based on knowledge about the design of that system and observations 
about its actual behavior (cf. [72, 71]). 

In the consistency-based approach to diagnosis, there is no knowledge as to how malfunctioning occurs 
and manifests itself, and only the "normal" behavior is axiomatized [43,26,76]. Therefore, diagnosis con- 
sists in isolating components that are not consistent with all other components acting normally. Formally, 
there is a set H of hypothesis (which in most of the formalizations comes in the form of abnormality- 
defining predicates), a background theory T, and a set O of observations; then, the problem is to single out a 
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set A C H so that TuOuAis consistent in the classical sense. For instance, in [76], a hypothesis ^ab{C) 
is introduced for each component C that can possibly be faulty, and what follows from the assumptions of 
normality is cast into rules. 

It appears that consistency-based diagnosis has a relationship with outlier detection as long as in both 
forms of reasoning one is interested in singling out "abnormality" in the knowledge at hand. However, the 
framework of outlier detection does not actually fit the diagnosis framework. First, there is an important 
difference in the encoding, in that, in outlier detection the background theory is not required to explicitly 
account for abnormal predicates and, more importantly, outliers are not required to be conflicting (in a 
logical sense) with the background theory, as evidenced in the previous examples. 

Actually, one may be interested in assessing some more structural differences between these forms of 
reasoning, by assuming that these syntactic issues can be faced by means of some suitable rewriting. For 
instance, one can assume that the background knowledge at hand may be revised in order to explicitly 
model the occurrence of anomalous observations, e.g., by means of integrity constraints. Even though there 
are no immediate translation mechanisms between the two frameworks in general — the translation in fact 
depends on the semantics of the application — this may be particularly simple in some situations, such as 
in our network example, where the encoding of the knowledge (that currently prevents up(X) and down(X) 
from being both true) might be changed by adding a constraint <— up(X), dowii(X). 

Then, it can be assumed that all the observations constitute the hypothesis for the problem, so that by 
consistency-based diagnosis bunches of observations conflicting with the theory can be singled out. How- 
ever, at this point, the differences between the two frameworks emerge more clearly; indeed, the problem of 
outlier detection is not yet reduced to finding such observations that somehow "contradict" the background 
knowledge, because there are observations that, though different from what prescribed by the background 
theory, are not outliers. This has been made evident in Example 3 above. Indeed, after a proper encoding 
stating that whenever reachable, a computer must be up, one could derive that the observations down(a) and 
dowii(c) are both faulty. However, it has already been noticed that dowii(a) cannot indeed be considered 
an outUer because there are no other observations supporting this claim. Conversely, down(c) is an outlier, 
given that there are five observations (namely, up(d), up(e), up(f ), up(g), and up(t)) that support the fact 
that c is up. 

From this perspective, outliers might abstractly be seen as faulty observations in some kinds of diagnosis 
whose reliability is further evidenced by the witness set, which would also have been considered faulty in 
the case where outfiers are not thrown out. This extra criterion is responsible for greater reliability of such 
diagnosis, since the anomalies are not just inferred from the background knowledge (which can be in fact 
incomplete and lead to misclassifications, as it would be for dowii(a)), but are further evidenced by the data 
in itself. 

Abduction and Diagnosis. In the abductive approach to diagnosis [58, 59, 18] there is a description of the 
system to be diagnosed, observations (symptoms), and possible reasons (faults) for the observations. The 
aim is to single out faults that may be the actual cause for the symptoms to hand. Formally, again the sets H, 
T and O are given, and a subset Z\ C if is sought that combined with T entails O. Intuitively, observations 
in O are assumed to be trustable, while the hypothesis may be revised. 

Abduction was studied by Pierce [69]. Since then, it has been widely recognized as an important prin- 
ciple for common-sense reasoning, a powerful mechanisms for hypothetical reasoning in the presence of 
incomplete information, and a solid framework for modelling practical applications. Abduction has been 
also investigated in logic programming (see, e.g., [27] and the references therein). In this context, the most 
influential definition is due to Kakas and Mancarella [52], but several other approaches have been also pro- 
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posed both from proof and model-theoretic perspectives (e.g, [64, 22, 53, 28, 31]). Moreover, extensions to 
settings where preferences are exploited while finding explanation (abduction with penalization) have been 
studied for both classical logics [30] and logic programming [70]. 

While trying to encode outlier detection in terms of abduction, the main problem is that in the former 
setting there is no exphcit distinction between trustable observations and revisable hypothesis. Specifically, 
all the observations are revisable, i.e., they may be faulty, and it is not known in advance which of them 
should be granted, i.e., the set O is generally unknown. Therefore, the translations appear to be even less 
immediate than for consistency-based approaches. 

One possibility to build the translation might be to exploit the fact that in outher detection it is known 
that singling out an outlier has the effect of justifying the witness set, which can be eventually seen as the set 
of trustable observations. Therefore, if this witness set, say W, was known in advance, the problem might 
be encoded in a standard abduction framework by assuming all the observations but W to be revisable; then, 
explanations for W are in fact outliers. However, since W is in general not known, some efforts must be 
dedicated to discover it. 

Again, the activity of identifying the witness sets constitutes the main source of computational complex- 
ity in outlier detection problems as well as their main distinguishing characteristics with respect to abductive 
problems. 

Belief Revision. In such scenarios where an agent background theory mismatches with a set of observations, 
it is important to revise the background knowledge, say T, by incorporating that new evidence O gained 
about the world. This process is generally known as belief change in the literature, and represents an active 
area of research in both Philosophy and Artificial Intelligence. 

One of the best known theories of rational belief change is the AGM theory of belief revision originated 
by Alchourron, Gaerdenfors, and Makinson [4] and further developed by Gaerdenfors [39] and Alchourron 
and Makinson [5,6]. Whenever O is consistent with T, the revision of T according to O, denoted by T * O, 
is the set of all the logical consequences of TU {O}, denoted by T-l- O. Otherwise, i.e., if O is not consistent 
with T, T * O is defined as (T — -lO) + O, where T — -lO is the contraction of T according to ^O, that is the 
set of all maximal subsets of T not entailing -lO. This approach is the core of AGM theory, which eventually 
exploits a set of postulates whose aim is to characterize the intuition of minimal change. A number of authors 
have favored the postulation approach, and subsequent works concerned extensions and refinements of the 
basic paradigm, namely the distinction between belief revision and belief update, work on iterated belief 
revision, and the use of epistemic states [54, 36, 37, 25]. 

A criticism raised about the basic paradigm concerns the success postulate, i.e., the assertion that the 
agent beUeves the most recent thing he learns. Specifically, [13] noticed that this postulate is undesirable 
in the case when an agent observation may be itself imprecise or noisy (which is precisely the case we are 
interested in studying in the outher detection framework), and proposed a model of belief revision under 
no obfigation to incorporate observed propositions into the current beUef set. Contextually, different mod- 
els of non-prioritized beUef change have been proposed in which no absolute priority is assigned to the 
new information due to its novelty [47]. These approaches can be grouped into three main categories: de- 
cision-i-revision, integrated choice, and expansion-i-consolidation. Decision+revision approaches first decide 
whether to accept or reject input O, and if the input is accepted, some of the beliefs in T are given up in 
order to incorporate O while retaining consistency. For example, screened revision [66] introduces a set A of 
core beliefs that are kept immune from revision, and the belief set is revised only if O is consistent with the 
set Ar\T. Integrated choice approaches perform the two above-mentioned steps simultaneously. This can 
be achieved by means of such choice mechanisms as epistemic entrenchment [39] or spheres-based revision 
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[44]. Finally, expansion+consolidation approaches add O to the belief set T, and then make the belief state 
consistent by deleting either (part of) O or some original beliefs [46]. 

Clearly, the problem of outUer detection is related with such approaches to beUef revision, given that 
its ultimate aim is to remove some kind of disagreement between the rule component and the observations. 
Specifically, since in the outher detection setting observations are to be doubted, while taking the background 
knowledge for granted, two policies might be adopted in order to obtain outUer detection by revision: either 
a model of non-prioritized beUef change might be directly exploited, or the roles of background knowledge 
and observations in the revision can be supposedly inverted, so that observations constitute the "initial" 
knowledge base and the background knowledge is used to revise it; then, the success postulate guarantees 
that the background theory remains unchanged and only observations are possibly given up. 

However, both the above solutions strongly rely on the fact that a revision has to be made as soon as 
merging observations with the theory to hand leads to inconsistencies, which is in fact the starting point 
of any kind of behef revision. But this is not the case for outlier detection problems, where the notion 
of inconsistency (in the classical sense) plays no role and where the mismatch is given by a rather subtle 
form of disagreement between some observations and the other data at hand. Therefore, as in the case of 
consistency-based diagnosis, a preliminary step for carrying out outlier detection via revision is to encode 
the rule component in a way that explicitly accounts for the isolation of conflicts; but, this is not going to be 
obvious in all the circumstances. 

However, the most important difficulty in the encoding lies again in the additional requirement for an 
outlier to be witnessed by some other set of observations. In this respect, outlier detection might be accom- 
modated in a framework for computing some kinds of preferred revision, in order to avoid the revision of 
observations that are not outliers, as it would occur in Example 3 with the observation down(a). To this aim, 
if a witness set W had been known in advance, one would have been able to define an encoding leading to 
an inconsistency whenever W would not be entailed by the theory, so that the role of the revision would 
have been precisely to single out outhers having W as a witness set. Therefore, again, isolating the witness 
sets appears to be a distinguishing characteristic of outher detection problems. 

1.4 Contribution and Plan of the Paper 

It is worth pointing out that outlier detection problems come in several different guises within settings that 
have been mainly investigated in the area of Knowledge Discovery in Databases and, recently, they have 
also emerged as a knowledge representation and reasoning problems, in the context of default logic [7] — 
refer to Section 7 for a thorough analysis of related literature. 

In this paper, the definition provided in [7] is basically followed for identifying anomalies in observa- 
tions, but the concept of outliers is formally stated in the context of logic programming under the stable 
model semantics for several reasons. Firstly, logic programs have been, in fact, proved to be a powerful 
tool for modelhng reasoning capabihties in multi-agent systems [45, 85, 83]. Secondly, outlier detection 
problems formalized using the logic programming paradigm have a natural translation into standard logic 
inference problems, thus making the framework presented here quite easily implemented on top of any ef- 
ficient inference engine (such as GnT [49], DLV [61], Smodels [67], and ASSAT [63]). Thirdly, the new 
formalization of outher detection presented here is better suited as the basis for generahzing the outlier 
detection problems formahzed in [7]. In more detail, the contributions of this paper are summarized below. 

> The notion of outlier in the context of logic programming-based Knowledge systems is formally defined. 
In particular, the definition introduced in [7] is generalized by allowing an outlier to consist of a set 
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of observations (modelled as ground facts) rather than of a single observation. This generalization is 
significant because there are actual situations where only non-singleton outliers can be detected. 

> Outlier detection problems are investigated in the context of skeptical semantics (the only one considered 
in [7]) as well as in the context of brave semantics. It should be noted that this does not simply add some 
formal details to the framework, since from the semantical viewpoint, referring to brave or skeptical 
reasoning in detection problems significantly changes the role that is played by outhers. 

> The computational complexity of some natural decision outUer detection problems is thoroughly investi- 
gated for the case of propositional logic programs. The results of this study (both for brave and cautious 
reasoning) are summarized in Figure 3. It can be noted that the complexity figures range from P to 
depending on the specific detection problem considered. It is also worth pointing out that, differently 
from what happens with most logic-based reasoning frameworks, in most of cases considered brave and 
cautious semantics induce the same complexity. Furthermore, the complexity of computing outliers is 
analyzed. 

> The data complexity of some basic outlier detection problems is investigated. This analysis is particu- 
larly useful in the context of outlier detection in database applications, where one is usually interested 
in understanding how the complexity of a problem varies as a function of the database size, and the 
rule component is assumed to held fixed, or anyway less frequently varied, as it usually encodes a set 
of constraints on the database schema. Hence, in the data complexity scenario, other than continuing 
investigating propositional programs, also non-ground logic programs are considered. 

> Several cost-based generalizations of outlier detection problems are formaUzed, accounting for a num- 
ber of interesting situations in which the computation of just any outlier is not what is really sought. 
Moreover, how this generalization influences the complexity of outher detection is also studied — see 
Figure 4. 

> The basic outUer detection framework assumes that observations (and, hence, outhers) come into play 
as sets of facts encoding some aspects of the current status of the world. However, there are situations 
where it would be desirable to have observations encoded as a logical theory (this might be required, for 
instance, for agents to be able to reconstruct an internal and possibly complex description of the outer 
environment by learning logical rules and then reasoning on the basis of them, e.g., about the behaviors 
of other agents). To this aim the concept of outlier is extended to be denoted, in general, by a set of 
logical rules and facts. The computational complexity of the problems arising in this extended setting is 
also accounted for in the paper. 

> In order to ease fast prototyping of outher-based reasoning frameworks, sound and complete algorithms 
for transforming any outlier problem into an equivalent inference problem under stable model semantics 
are presented. The transformations can thus be used for effectively implementing outlier detection on 
top of any available stable model solver (e.g., [49, 61, 67, 63]). 
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The rest of the paper is organized as follows. Section 2 contains some preliminaries on logic programs 
and on the main complexity classes dealt with in the paper. The basic definition of outliers under both 
brave and cautious semantics is introduced in Section 3, where the complexity of some outlier detection 
problems is also investigated. Section 4 proposes a generalization of the basic framework in which minimum 
size-outliers are sought, and studies how this additional requirement influences the basic difficulty of the 
detection problems. A different kind of extension, that is, the possibility of having observations encoded 
as logic theories is discussed in Section 5. Then, Section 6 illustrates a sound and complete rewriting for 
implementing outlier detection on top of stable model engines. Finally, Section 7 discusses some related 
work and, in Section 8, conclusions are drawn. 

2 Preliminaries 
2.1 Logic Programs 

A term is a constant or a variable. An atom is of the form p(ti, t^) where p is a fc-ary predicate symbol 
and 1 1 , . . . , tk are terms ; in the case /c = 0, the atom is called propositional letter and parenthesis are omitted. 
A literal is an atom a or its negation not a. 

A rule r is a syntactic clause of the form: a <— bi, • • ■ , bk, not ci, • • • , not Cn-, where A;, n > 0, and 
a, bi, • ■ ■ , bk, ci, • • • , Cn are atoms. The atom a, also denoted by h(r), is the head of r, while the conjunction 
bi, . . . , bk, not ci, • • • , not Cn, also denoted by b(r), is the body of r. A rule with n = is called positive. 
A rule with an empty body (i.e. n = = 0) is called a fact is omitted). 

A logic program (short: LP) P is a finite set of rules. P is positive if all the rules are positive. P is 
stratified, if there is an assignment s(-) of integers to the predicate symbols in P such that for each rule r 
in P the following holds: if p is the atom in the head of r and q (resp. not q) occurs in r, then s{p) > s{q) 
(resp. s{p) > s{q)). Moreover, P is propositional if all the atoms in it are propositional letters. 

The Herbrand Universe Up of a program P is the set of all constants appearing in P, and its Herbrand 
Base Bp is the set of all ground atoms constructed from the predicates appearing in P and the constants 
from Up. A ground term (resp. an atom, a literal, a rule or a program) is a term (resp. an atom, a literal, a 
rule or a program) where no variables occur. A rule r' is a ground instance of a rule r, if r' can be obtained 
from r by consistently replacing variables occurring in r with constants in Up. By ground{P) the set of all 
ground instances of the rules in P is denoted. 

In the following, background knowledge bases encoded by means of ground programs, or, equivalently, 
by means of propositional logic programs'* are (mainly) considered. While this is a rather natural setting 
most often adopted in the literature for introducing and discussing the complexity of various basic reasoning 
tasks, it is relevant to note that the proposed outUer detection framework is general enough to cope with rule 
components encoded also by means of non-ground programs (the reader may check that no modification 
at all is required in the basic definitions). This is, for instance, the case of our running examples, where 
the use of variables has been pursued to keep the encoding compact and to help the reader's intuition in 
understanding the main features of outUer detection problems. Clearly, in these cases, as far as complexity 
studies are concerned, given a logic program P with variables, the input to our reasoning tasks is to be 
understood as its ground version ground{P). Notably, the semantics of a program P is in fact precisely 
defined in terms of its ground version, as discussed below. 

* Indeed, any ground program P may be equivalently seen as a propositional one, by replacing each atom of the form p(ti , . . . , tk) 
where each ti (1 < i < fc) is a constant, with the propositional letter p*'' "'*'". 
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An interpretation of P is any subset of Bp. The truth value of a ground atom L w.r.t. an interpretation /, 
denoted valuei{L), is 1 (true) if L G / and (false) otherwise. The value of a ground negated literal not L 
is 1 — valueiiV). The truth value of a conjunction of ground literals C = Li, . . . , Ln is the minimum over 
the values of the Li, i.e. valuej{C) = min{{valuej (Li) | 1 < i < n}). If n = 0, then valuei{C) = true. 
A ground rule r is satisfied by / if valuei{h{r)) > valuei{h{r)). Thus, a rule r with empty body is 
satisfied by / if valuei{h.{r)) = true. An interpretation M for P is a model of P if M satisfies all rules in 
ground(P). 

The minimal model semantics assigns to a positive program P its unique minimal model M.M^{P). A 
model M for P is minimal if no proper subset of M is a model for P. For a general program P, the stable 
model semantics [41] assigns to P the set SM.{P) of its stable models defined as follows. Let P be a logic 
program and let / be an interpretation for P. Then, the reduct of P w.r.t /, denoted by P^, is the ground 
positive program derived from ground{P) by (1) removing all rules that contain a negative literal not a in 
the body and a € /, and (2) removing all negative literals from the remaining rules. An interpretation M is a 
stable model for P if and only if M = M.M. (P^). It is well known that stable models are minimal models 
and that stratified logic programs have a unique stable model (see, e.g., [24]). 

Let W be a set of facts. Then, program P bravely entails W (resp. -iW), denoted by P W (resp. 
P 1=6 -iW), if there exists M G SM.{P) such that each fact in W is evaluated true (resp. false) in M. 
Conversely, P cautiously entails W (resp. -iW), denoted by P |=c >V (resp. P ~'W), if for each model 
M G SM.{P), each fact in W is true (resp. false) in M. Clearly, for a positive or stratified program P, 

P K wiff^ H w. 

2.2 Computational Complexity 

Some basic definitions about complexity theory are recalled next. The reader is referred to [68, 50] for more 
on this. 

Decision problems are maps from strings (encoding the input instance over a suitable alphabet) to the 
set {"yes", "no"}. A (possibly nondeterministic) Turing machine M answers a decision problem if on a 
given input x, (i) a branch of M halts in an accepting state iff x is a "yes" instance, and (ii) all the branches 
of M halt in some rejecting state iff x is a "no"instance. 

The class P is the set of decision problems that can be answered by a deterministic Turing machine 
in polynomial time. The classes and n^, forming the polynomial hierarchy, are defined as follows: 
= il^ = P and for all k > 1, = NP^^-i, = P^^-i, and 77f = co-^f where co-^f 
denotes the class of problems whose complementary version is solvable in Ef^, and where E^ (resp. A^) 
models computability by a nondeterministic (resp. deterministic) polynomial-time Turing machine which 
may use an oracle that is, loosely speaking, a subprogram, that can be run with no computational cost, for 
solving a problem in E^_^. The class Ef of decision problems that can be solved by a nondeterministic 
Turing machine in polynomial time is also denoted by NP, while the class of decision problems whose 
complementary problem is in NP, is denoted by co-NP. The class Dj^, k > 1, is the class of problems 
defined as a conjunction of two independent problems, one from E^ and one from U^, respectively. Note 
that, for all k > 1, E^ C C E^^^. 

Functions (also computation problems) are (partial) maps from strings to strings, which can be computed 
by suitable Turing machines, called transducers, that have an output tape. In particular, a transducer T 
computes a string y on input x, if some branch of the computation of T on x halts in an accepting state and, 
in that state, y is on the output tape of T. Thus, a function / is computed by T, if (/) T computes y on input 
X iff f{x) = y, and (ii) all the branches of T halt in some rejecting state iff f{x) is undefined. 
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In this paper, some classes of computation problems will be referred to which are illustrated next (see, 
also, [57, 79]). The class FP is the set of all the polynomial time computable functions, which are functions 
computed by polynomial-time bounded deterministic transducers. More generally, for each class of decision 
problems, say C, FC denotes its functional version; for instance, FNP denotes the class of functions com- 
puted by nondeterministic transducers in polynomial time, denotes the class of functions computed in 
polynomial time by nondeterministic transducers which use an NP oracle, and FA2 denotes the functions 
computed, in polynomial time, by a deterministic transducer which uses an NP oracle. In the following some 
further classes will also be referred to, which will be defined when needed. 

In conclusion, the notion of reduction for decision and computation problems should be recalled. A 
decision problem Ai is polynomially reducible to a decision problem A2 if there is a polynomial time 
computable function h such that for every x, h{x) is defined and A\ output "yes" on input x iff A2 outputs 
"yes" on input h{x). A decision problem A is complete for the class C of the polynomial hierarchy iff A 
belongs to C and every problem in C is polynomially reducible to A. Moreover, a function /i is reducible to 
a function /2 if there is a pair of polynomial-time computable functions hi , /i2 such that, for every x, hi (x) 
is defined, and /i(a;) = h2{x, w) where w = /2(/ii(x)). A function / is hard for a class of functions FC, if 
every f E J^is polynomially reducible to /, and is complete for FC, if it is hard for FC and belongs to FC. 

3 Defining Outliers 

In this section, the notions and the basic definitions involved in our framework are introduced and the main 
problems studied in the paper are described and formalized. 

3.1 Formal Framework 

Let P"^'** be a logic program encoding general knowledge about the world, called rule component, and let 
pobs ^ encoding some observed aspects of the current status of the world, called observation 

component. Then, the structure V = {P"^^-, P°^^), relating the general knowledge encoded in P""'^ with the 
evidence about the world encoded in po^s^ jg ^ rule-observation pair, and it constitutes the input for outlier 
detection problems. 

Indeed, given V, it is interesting to identify (if there is one) a set O of observations (facts in po^s-) 
that are "anomalous" according to the general theory P"^'^ and the other facts in P°^^ \ O. Quite roughly 
speaking, the idea underlying the identification of O is to discover a witness set W C po^s^ jj^^j ^ 
of facts which would be explained in the theory if and only if all the facts in O were not observed. This 
intuition is formalized in the following definition. 

Definition 1 (Outlier). Let V = (pi'is pobs^ ^ rule-observation pair and let O C P°^'' be a set facts. 
Then, O is an outlier in V if there is a non-empty set W C P°^^ with W fl C = 0, called outlier witness for 
O in V, such that: 

1. P{V)w N and 

2. P{V)w,o ^ 

where P(P) = P'^"" U P°^^ P(P)w = P{P) \ W, P{V)w,o = P{V)w \ O, and \= denotes entailment 
under either cautious semantics (\=c) or brave semantics (\=b)- D 

As an example application of the definition above, let us consider again the network of Example 3. 
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Example 4. Let us consider the rule-observation pair Vn = {P^^^ ^'n'^)' where the program P^® 
consists of the rules shown on the left of Figure 2, and comprises the observed facts 

{down(a),up(b),down(c),up(d),up(e),up(f),up(g), up(h), up(t)}. 

Let W be the set {up(d), up(e), up(g), up(f ), up(t)} and O = {down(c)}. Then, it is easy to see that 
P{Pn)w \=b "'VV and P{'Pn)w,o V^h "'VV. Therefore, {dowii(c)} is an outlier in Vn, and W is an outher 
witness for O in Vn (under the brave semantics). Actually, since the program is stratified and there is 
exactly one stable model, it is the case that {down(c)} is an outUer and W its witness also under cautious 
semantics. < 

Let us now take a closer look at Definition 1. First, it is worthwhile noting that the definition is a 
generalization of the one proposed in [7], since an outlier is not constrained to be a literal, but it might 
consists of several individual facts. Accordingly, it was explicitly required that outliers must not overlap 
with their witness sets, in order to avoid situations where a set of facts supports by itself its anomaly. 
As an example of such a situation, consider the rule-observation pair Vq = {Pq^,Pq°^), where P^^ = 
{a <— b. b ^ not c.} and Pq^^ = {a, b, c}. Consider also the sets Oq = {b, c} and Wo = {a, b}. 
Clearly, Oq and Wo satisfy both conditions (1) and (2) in Definition 1, but they are not considered to be 
an outUer and its associated witness, respectively, because they are not disjoint. The problem here is that 
b appears to be an outUer only if this is witnessed by b itself — it is easy to check that Wq = {a} is 
not a witness for Co- Note that, situations such as the one described above cannot occur when outliers 
are singleton sets (cf. [7]) for which disjointness between an outlier and its associated witness is trivially 
guaranteed in order to satisfy condition (2) in Definition 1 . 

The second important feature accounted for in Definition 1 is the possibility of dealing with the two 
different semantics that are commonly adopted in the logic programming framework, which are, brave and 
the cautious semantics. Indeed, the semantics is assumed to be part of the input in the problem of outlier 
detection, and it is therefore fixed after a suitable entailment operator |= in the set {\=b, |=c} is selected. 
After this choice was made by the designer of the rule component, the process of singling out outliers will 
be carried out consistently. Obviously, if the program has a unique stable model (for instance, in the case 
it is positive or stratified as in Example 4), then brave and cautious semantics coincide. For this reason, in 
the rest of the paper, for stratified or positive programs there is no distinction among the semantics - for 
instance, it will be said simply that P entails a set W. 

However, next is shown that in some scenarios one notion of entailment appears to be more appropriate 
with respect to the other, as discussed in the following two paragraphs. 

Cautious Semantics. Let us consider again Example 2, and let us denote by P^q the set of facts shown in 
Figure 1, and by P%% the rule component including just the following rule: 

Approved(L) •<— ReqLoan(L, C, A), A > 50.000, Endorsement (L, P), not Unreliable(P). 

Let Vdb be the rule-observation pair = {P]^%,Pd'b)' observe that the set {Unreliable(pi)} is an 
outlier in Vdb whose witness is {Approved(/i)}. 

Assume now that the database is updated with some new data that the Bank has collected by integrating 
several distributed local databases into a unique datawharehouse, and let Pg*^ be such a modified database. 
Data stored in different sources are not required to satisfy integrity constraints issued on the Bank schema. 
Therefore, after the integration is carried out, it might happen that some integrity constraints are violated. 
Specifically, assume for instance that the first two attributes of ReqLoan are in fact a key for the relation, 
and that the tuple ReqLoan(Zi,ci, 10.000) is added to Pg*^ in the integration process, so that a conflict with 
ReqLoan(Zi,ci, 57.000) occurs. 
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The standard approach in the Uterature for facing the presence of inconsistencies with respect to in- 
tegrity constraints is to carry out some "repair" of the data [9], i.e., to identify a suitable (minimal) set 
of deletion/addition of facts in the database that restore the system to a consistent state. For instance, 
in our example, there are two possible ways for repairing the database, that are either deleting the tuple 
ReqLoan(/i,ci,10.000) or deleting the tuple ReqLoan(/i,ci, 57.000) — let Ri and R2 be the two repairs 
that are computed according to such modifications. Then, whenever a query is issued over the repaired 
database, only consistent answers are retrieved, i.e., answers that are evaluated true with respect to all the 
possible repairs. In fact, several data integration systems supporting consistent query answering have already 
been proposed in the Uterature [38, 62, 20], which exploit a suitable encoding in terms of logic programs that 
ensures a one-to-one correspondence between stable models and repairs for the system. 

As an example, the repair approach can be encoded by means of the program P^'g defined as follows: 

ReqLoaii'(L, C,A) ^ ReqLoaii(L, C, A), ReqLoan(L, C, Al), A Al,not ReqLoan'(L, C, Al). 

Approved(L) ReqLoaii'(L, C, A), A > 50.000, Endorsement (L, P), not Unreliable(P). 

where the first rule takes care of the key on ReqLoan (in particular, it ensures that the primed relation do not 
violate the key on ReqLoan), and the second rule is the rewriting of the original rule-component. 

Then, stable models for the program Pj^% U are in one-to-one correspondence with repairs of 
the data integration system, and therefore consistent query answering coincides with cautious reason- 
ing over the encoding, which is more appropriate than brave reasoning for data integration tasks. Ac- 
cordingly, letting Pdb = {Pdb^Pdb)' we have that P{pDB){kppToved{h)} \=c -'Approved(/;), since 
in both the repairs Ri and R2, Approved(/j) cannot be entailed because pi is unreliable. Moreover, 
-P(^Ds){Approved(Zj)},{Unreiiabie{pj)} docs uot cautiously entail ^Approved(/i). Indeed, at one hand, 
according to repair R2, i.e., by deleting ReqLoan(Zi,ci, 57.000), there is no need at all for deriving 
Approved(Z^) since the loan does no longer require any approval. However, at the other hand, accord- 
ing to repair Ri, i.e., by deleting ReqLoan(^i,ci,10.000), Approved(Zi) can be entailed given that 
Unreliable(p^ ) is being doubted about. Thus, we cannot be completely sure that -iApproved(i^) is 
entailed by the program P(:PDB){Approved(z,)},{unreiiabie(pi)} and, therefore, {Unreliable(pj )} and 
{Approved(/i )} represent an outlier and its associated witness under the cautious semantics. 

Brave Semantics. Let us focus again on Example 3, and consider a slight modification of the encoding in 
P^, where the following rules have been added: 

: wired(c', X) ^ wired(c, X). 
rg : wired(X, c') wired(X, c). 
rg : computer (c'). 

rio : down(c') ^ internalFailure(c'). 

rii : internalFailure(c') <— computer (c'), not properlyWorking(c'). 
ri2 : properlyWorking(c') computer (c'), not internalFailure(c'). 

encoding the fact that another computer c' with the same connections as c is added to the network and that 
c' is an old piece of hardware that is known to be subject to possibly internal faults (rules rio...ri2). 

Let P^^ be such a modified rule-component, and let Pn = {P^^, Pn'^), i-C-, assume that the same set 
of observations of Example 3 has been obtained by agent A, which therefore has no knowledge about the 
current status of c'. 
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Clearly, P{Vn) has two possible models, corresponding to the situation in which c' is either up or down. 
In order to single out any possible anomalous situations brave semantics should be adopted, in this case. This 
is also often assumed in most diagnostic approaches. 

Let W be the set {up(d), up(e), up(g), up(f ), up(t)} and O = {down(c)}. Then, it is easy to see that 
P{'Pn)w \=b indeed, in the case c' is down, none of d, e, f , g, and t can be up. Thus, condition (1) in 
Definition 1 is satisfied. As for condition (2), it is easy to see that P{Vn)w,o V^b Indeed, if dowii(c) 
is not trusted, then all the computers in d, e, f , g, and t will be entailed to be up, no matter what the status of 
c' is. Thus, under the brave semantics, down(c) remains an outlier in this modified scenario. It is worthwhile 
noting that according to the cautious semantics, instead, -i>V is not entailed by P((Pn)v\>, given that there is 
a model (where c' is up) in which nothing anomalous can be singled out in these observations in W. 

Therefore, brave semantics appears to be quite useful in this kind of diagnostic scenario for it allows 
the identification of all the situations which are possibly anomalous rather than focusing only on absolutely 
rehable ones. 

So, there are scenarios where one of the either forms of reasoning is suitably applied. It is therefore 
natural to ask whether there are some particular relationships between the two notions. This section is 
concluded by noting that this is not the case, as can be formally verified by exploiting the asymmetry of 
Definition 1 . Indeed, under cautious reasoning the definition is strict in requiring that for each stable model, 
the witness set W is not entailed by the theory obtained by removing W itself (point 1), but then it just 
requires the existence of a model explaining some facts in W after the removal of the outlier O (point 2). 
Conversely, under brave semantics the definition is loose in the first point, because it requires that W is true 
in at least one stable model, but, for point (2), it requires that each model of P{V)y\;^o entails some facts 
in W. 

It turns out that the two semantics are both of interest, since, intuitively, referring to brave or skeptical 
reasoning in outlier detection problems significantly changes the role that, from a computational viewpoint, 
is played by literals encoding outliers — this role will be further clarified in the following sections, while 
discussing the complexity of outUer detection problems. Moreover, by looking at the example above, it 
even seems that the brave semantics better captures the intuition behind outlier detection in some kinds of 
diagnostic application; in this respect, extending the definition in [7] to encompass both the semantics (rather 
than referring to the cautious one alone) allowed us to look at outlier detection problems from a different 
perspective. 

3.2 Basic Results 

Now that the notion of outUer has been formalized, next the study of the most basic problems arising in this 
setting is addressed. The first basic problem considered here is the problem EXISTENCE defined as follows. 
Given in input a rule-observation pair V = {P^^^, P"*^^), EXISTENCE is the problem of deciding the existence 
of an outlier in V. Clearly, the complexity of EXISTENCE strictly depends on what kind of logic program 
P^^ is. A very simple case is where P"^'^ is a positive logic program. 

Theorem 1. Let V = {P^^^, P°^^) be a rule-observation pair such that P'^^ is positive. Then, there are no 
outliers in V. 

Proof. By contradiction, assume that there is an outlier O C p°bs ^^^j^ witness W C p°t>s -p j^gj pi 
denote the logic program P('P)w,C'- Notice that P' has a unique model, say M. From condition (2) in 
Definition 1 it is known that P' ^ -iW. Hence, it can be inferred that there is u; G W such that w G M, i.e. 
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that P' \= w. Thus, it also holds, for the monotonicity property, that P' U O \= w, i.e. that P{'P)y\; ^ -^W 
and thus W is not a witness, since it would violate condition (1) in Definition 1. □ 



Let us now consider a more involved scenario, in which P'^ is stratified. Even though in logic program- 
ming adding stratified negation does not increase the complexity of identifying the unique minimal model 
with respect to the negation-free case, it is next shown that negation (even in the stratified form) does indeed 
matter in the context of outlier detection. Indeed, the EXISTENCE problem becomes more difficult in this 
case, and even unhkely to be solvable in polynomial time. 

Before proving the results, some basic definitions are introduced that will be used in the proofs. Let L 
be a set of literals. Then, it is denoted by the set {p \ p is a positive Uteral in L}, and by L~ the set 
{p I -ip is a negative literal in L}. Let L be a consistent set of literals. By the truth assignment on the 
set of letters occurring in L is denoted such that, for each literal p G L~^, aiip) = true, and for each 
literal -ip G L~, aiip) = false. Let cr be a truth assignment of the set {xi, . . . , Xn} of boolean variables. 
Then, it is denoted by Lit{a) the set of Uterals {^i, . . . , in}, such that £i is Xj if a{xi) = true and is -iXi 
if = false, for i = 1, . . . , n. Let £ be a literal. Then, it is denoted by p{£) the expression i, if £ is 

positive, and the expression not i', if I is negative and £ = -^£'. 

Theorem 2. Let V = {P^^^, P°^^) be a rule-observation pair such that P""^^ is stratified. Then, EXISTENCE 
is 'HP-complete. 

Proof. (Membership) Given a rule-observation pair V = {P^^'^, P°^'^), it must be shown that there are two 
sets O C P°bs g^j^j^ jj^^j P{V)w 1= (query q') and P{V)w,0 ^ (query q"). P{V) is stratified 
and, hence, has a unique stable model. Moreover, both query q' and q" are P-complete (see, e.g., [24]). 
Thus, a polynomial-time nondeterministic Turing machine can be built solving EXISTENCE as follows: the 
machine guesses both the sets W and O and then solves queries q' and q" in polynomial time. 

(Hardness) Recall that deciding whether a Boolean formula in conjunctive normal form <^ = ci A . . . Ac^ 
over the variables xi, . . . , x„ is satisfiable, i.e., deciding whether there are truth assignments to the variables 
making each clause Cj true, is an NP-hard problem, even if each clause contains at most three distinct 
(positive or negated) variables, and each variable occurs in at most three clauses [40]. Without loss of 
generality, assume <P contains at least one clause and one variable. 

A rule-observation pair V{^) = (P''i'^(^>),P°>^'^(^)) is defined such that: (i) P°bs(^^) contains exactly 
the fact Xi for each variable Xi in <P, and the facts sat and disabled; (ii) P'"'®(<P) is 



Clearly, 'P{'^) is stratified and can be built in polynomial time. Now it is shown that <P is satisfiable <;=^ 
there is an outher in V{^). 

(=^) Suppose that $ is satisfiable, and take one of its satisfying truth assignments, say a'^, for the vari- 
ables xi,...,Xn. Let X be the set Lit(cr^)~, and X' be a generic subset of X. It is shown that 
O = {disabled} U {X \ X') is an outlier with witness W = {sat} U X' in V{^). 
Indeed, the unique stable model of the program P(7'(<?))w is such that each atom Cj (associated to a 
clause) is false since disabled is true, since it being not removed from P°^®(#). Hence, sat cannot be 




pitj^i), not disabled.' 

p{tj,2), not disabled. > ^1 < j < m, s.t. Cj = tj^i V tj^2 V tj^3 
p{tj^z)i not disabled. ^ 



sat 



ci, ... 
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entailed in P(P(<P))yy; moreover, any atom in X' cannot be entailed in P{V{^))y\; also (because there 
is no rule suitable for the entailment) and, therefore, condition (1) in Definition 1 is satisfied. Consider, 
now, the set O = {disabled} U {X \ X'). It is easy to see that an atom associated with a variable, say 
Xi, is false in P{V{(P))v\!,o if and only if xi G X. Thus, -P('P(<?))w'.c> has the effect of evaluating the 
truth value of the assignment a^. Given that cj'^ is a satisfying assignment, the unique stable model of 
P{V{'P))y\>^o contains sat, thereby satisfying condition (2) in Definition 1. Hence, O is an outlier in 
V{^), and W is a witness for it. 

Suppose that there is an outlier O in Vi'P), and let W be its associated witness set. Notice that in 
order to satisfy condition (2) in Definition 1, W must contain at least a fact that can be eventually 
entailed by P(<^)>v,c'. Clearly, the only fact satisfying this requirement among those in P'''^^^) IS 
sat. Therefore, it must be the case that P(#)>v,o |= sat and, consequently, disabled is in W U O, 
and hence it is false in P(^)w,C'- Then, given that whenever disabled is false the rule component 
evaluates the truth value of formula it is the case that there is a satisfying assignment such that 
(W U O) n {xi, . . . , x„} = Lit(fT^)-. 

Before closing the proof, note that in the '=^'-part above, the task of guessing a truth assignment is 
"shared" by the two sets W and O. Moreover, the existence of an outlier implies ('<J='-part) that <P is 
satisfiable and, eventually (from the '=^'-part), that there is an outlier O having the form {disabled} U 
{X \ X'). Thus, the following stronger claim also holds: ^ is satisfiable <^ there is an outlier in 'P{^) 44> 
O = {disabled} U {X \ X') is an outher with witness W = {sat} U X' in P(<^), where X = Lit(cr^)" 
with (j^ being a truth assignment for the variables xi,...,Xn that makes <P true, and X' is a generic subset 
oiX. □ 

The complexity of the EXISTENCE problem in the most general setting is studied next. The following 
theorem shows that, under the brave semantics, the problem for general programs lies one level up in the 
polynomial hierarchy w.r.t. the complexity associated with stratified programs. 

Theorem 3. EXISTENCE under the brave semantics is E2 -complete. 

Proof. (Membership) Given a rule-observation pair V = {P'^^^, P°^^), it must be shown that there are two 
sets W,0 C P°bs g^^jj jjj^j P{V)yv 1=6 (query q') and P{V)w,o (query q"). Query q' is 

NP-complete, while query q" is co-NP-complete (see, e.g., [24]). Thus, a polynomial-time nondeterministic 
Turing machine can be built with a NP oracle, solving query EXISTENCE as follows: the machine guesses 
both the sets W and O and then solves queries q' and q" by two calls to the oracle. 

(Hardness) Let ^ = 3XVY/ be a quantified Boolean formula in disjunctive normal form, i.e., / is a 
Boolean formula of the form di V . . . V djn, over the variables X = xi, . . . x„, and Y = yi, . . .yq. Deciding 
the validity of these formulas is a well-known Z'|'-complete problem. Without loss of generality, assume that 
each disjunct dj contains three literals at most. With <P the rule-observation pair 'P(#) = {P'^^^{<P), P°^^{<P)) 
is associated such that: (i) P°^^^{^) contains exactly a fact Xi for each variable Xi in and the facts sat and 
disabled; (ii) P'^{^) is 

yi not bi- 1 < i < 9 

bi not yi- i < i < q 

sat <— p{tj,i), p{tj^2), pitj,3), not disabled. ^ < j < m s.t. dj = tj^i A tj^2 A tj^3 
Clearly, 'P(<P) can be built in polynomial time. Now it is shown that <P is valid there is an outUer in V{^). 
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(^) Suppose that ^ is valid, and let a be a truth value assignment for the existentially quantified vari- 
ables X that makes # valid. Next the sets W and O are built, by exploiting the ideas of the reduc- 
tion in Theorem 2: Let X be the set Lit(o"^) , and X' be a generic subset of X. It is shown that 
O = {disabled] U \ X') is an outlier with witness W = {sat} U X' in P(^). 
Note that, since disabled W, the program P('P(^))w; cannot entail sat (under any semantics); 
moreover, any atom in X' cannot be entailed in P{V{^))y\! as well (because there is no rule suitable 
for the entailment) and, therefore, condition (1) in Definition 1 is satisfied. As for condition (2) in 
Definition 1, note that the stable models of the program P{V(<P))yv,o are in one-to-one correspondence 
with the truth assigimients of the universally quantified variables. In particular, given that a-^ makes 
the formula valid, each stable modelhas the form {sat}ULit(cT^)^ULit((T^)^U{6j | yi G Lit((T^) }. 
Hence, it holds that -P(P(^))w,o V^h -^sat. 

(<^) Suppose that there is an outlier O with witness W in V{^). As sat is the unique fact in P°^^{([>) that 
can be derived by P('P(<P))>v,o, then in order to satisfy condition (2) of Definition 1, it is the case 
that W contains sat. Furthermore, in order to satisfy condition (1) of Definition 1, sat must be not 
entailed in P{V{^))y\). To this aim, either disabled is in W or, letting X = W {xi, . . . , for any 
truth-value assignment ct([xi xn}\x)yj^x^ there is no assignment for the universally quantified variables 
satisfying the formula. Finally, in order to have P{V{<P))y^^o V^b ^sat, the set O is such that, letting 
X' ={p U W) n {xi, . . . , Xn}, then cF(^{xi,...,xn}\x')\j^x' is a truth value assignment for the existentially 
quantified variables X that makes # vaUd. □ 

Note that even though outUer detection on general logic programs turned out to be "intrinsically" more 
complex than detection on stratified logic programs, the sources of the difficulties remain unchanged under 
brave semantics. In fact, the reduction in the above theorem points out that the task of guessing a satisfying 
truth assignment is still "shared" by the two sets W and O, in the same way as the reduction in Theorem 2. 

Next cautious semantics is considered. Whereas, for most reasoning tasks, switching from brave to 
cautious reasoning usually implies the complexity to "switch" accordingly from a certain class C to the 
complementary class co-C, this is not the case for our EXISTENCE problem. 

Theorem 4. EXISTENCE under the cautious semantics is E2 -complete. 

Proof. (Membership) Given a rule-observation pair V = {P^^^, P°^^), it must be shown that there are two 
sets W, O C such that P('P)w (query q') and P{V)w,o (query q"). Query q' is 

co-NP-complete, while query q" is NP-complete (see, e.g., [24]). Thus, a polynomial-time nondeterministic 
Turing machine with a NP oracle can be built, solving query EXISTENCE as follows: the machine guesses 
both the sets W and O and then solves queries q' and q" by two calls to the oracle. 

(Hardness) Let ^ = 3XVY/ be a quantified Boolean formula in disjunctive normal form, i.e., / is 
a Boolean formula of the form Di V . . . V Dm, over the variables X = xi, . . . and Y = yi, . . .yq. 
We associate with # the rule-observation pair = (P'^'^(<P), P°^^(#)) such that: (i) P°^^{^) contains 
exactly a fact Xi for each variable Xi in <P, and the facts unsat and disabled; (ii) P''^^(#) is 

yi not bi- i < i < Q 

bi ^ not yi. 1 < « < 9 

sat <- p{tj^i),p{tj^2),p{tj,3)- ^ <j <m s.t. Dj = tj^i A tj^2 A ij,3 
unsat <— not sat. 
unsat <— not disabled. 

Clearly, V{^) can be built in polynomial time. Now it is shown that <P is valid <^ there is an outUer in 
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(^) Suppose that is valid, and let a be a truth value assignment for the existentially quantified variables 
X that makes # valid. Consider the set W composed by the fact unsat plus all the facts associated to the 
variables that are false in cr^, that is the set {unsat} U Lit (cr''^) , and consider the set O composed only 
by the fact disabled. Note that, the stable models of the program P('P((p))w are in one-to-one corre- 
spondence with the truth assignments cry of the universally quantified variables. In particular, the unique 
stable model My is given by the set lAi{a'^)^ U \-i\t{a^)^ U {hi \ yi € Lit((7-*^) } U {sat, disabled}. 
Indeed, since the formula is satisfied by cr^, for each My, sat G My and unsat ^ My. Hence, 
P('P(#))w \=c ~'W. Conversely, the program P{V{^))w,o in which disabledis false, trivially derives 
unsat. It can be concluded that O is an outlier in and W is a witness for it. 

(<^) Suppose that there is an outlier O with witness W in V{^). As unsat is the unique fact in P°bs(^^ 
that can be derived by P(P(#))vv; o, then in order to satisfy condition (2) of Definition 1, it is the case 
that W contains unsat and that P(P(^))w \=c ^unsat. Furthermore, in order to satisfy condition (1) 
of Definition 1, disabled does not belong to W. Thus, {unsat} C W C {unsat, xi, . . . , Xn}- Let X 
be the subset W \ {unsat} and let a-^ be the truth value assignment cr(^^^-^^^ .^^j\x)u^x to the set of 
variables X. Clearly, P{V{0))yv \=c {{xi, ...,Xn}\X)U ^X. As P(P(#j')w -^unsat, then it 
is the case that for each subset Y of Y, the stable model My of P{V{(P))w associated with Y, that is 
the model My containing Y and no other fact from Y, is such that sat G My. That is, for each truth 
value assignment to the variables in the set Y, there is at least a disjunct such that a-^ o makes 
the formula / true. As a consequence, <P is vaUd. To conclude the proof, note that O = {disabled} is 
always an outUer having such a witness. □ 

It should be pointed out in conclusion that even though the complexity of the EXISTENCE problem turned 
out to be the same for both brave and cautious semantics, the nature of the problems are still quite different. 
In fact, the proof of the above theorem shows that under cautious semantics the witness is alone responsi- 
ble for guessing the whole satisfying assignment. Conversely, it has already been noticed that under brave 
semantics outher detection requires the efforts of determining both the witness and the outliers that, in fact, 
both contribute to the task of guessing the satisfying assignment - see proof of Theorem 3. Intuitively, the 
reason for this behavior under cautious semantics lies in the fact that the condition (2) of Definition 1, i.e., 
P{V{^))y\;^o VV, just amounts at identifying a model of P(P(^))w;,c) that does not entail an element 
of W. Hence, outUers act just as "switches" under cautious semantics, for they solely prevent the entailment 
of any element of W. 

The consequences of this difference will become more evident in the next section, while discussing the 
complexity of other outlier detection problems. Intuitively, expect that each time the witness set is given or 
has its size fixed, then the outUer detection problem will have the same complexity as the general case under 
brave semantics, but will become easier under cautious semantics. 

3.3 Computational Complexity of Outlier Checking Problems 

In this section the complexity of some further problems related to outlier identification is studied. Specifi- 
cally, given a rule-observation pair V = {P^^^^, P°^^), the following problems will be considered: 

• OUTLIER- CHECKING: given O C P°^\ is O an outher for some witness set W? 

• WITNESS -CHECKING problem: given W Q P°^\ is W a witness for some outher OinPl 

• OW- CHECKING: given O, W C P°^\ is O an outlier in V with witness W? 

The following theorem states the complexity of the first of the problems listed above. 
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Theorem 5. Let V = (P'^'^ P°^^) be a rule-observation pair Then, OUTLIER- CHECKING is 

1. NP-complete, for stratified LPs, and 

2. U2 '(complete ( under both brave and cautious semantics) for general LPs. 
Proof. 

1. As for the membership, given O C pobs^ ^et us guess a set W and verify that it is an outlier witness in V 
for O. To this aim it must be verified that conditions P{V)v\> \= and P('P)w,o hold. These 
tasks are feasible in polynomial time since P(P) is stratified and hence it has a unique stable model that 
can be computed in polynomial time. As for the hardness, the same line of reasoning as the proof of 
Theorem 2 is exploited, in which given a formula ^ rule-observation pair 'P(#) = P°'''^(^)) 
has been built. It follows immediately from the proof of Theorem 2 (by letting X' = X) that O = 
{disahled^ is an outher <J4> the formula ^ is satisfiable. 

2. As for the membership, given O C p°bs^ guess a set W and verify that it is an outlier witness in 
V for O. These latter tasks amount to solving an NP and a co-NP problem, as seen in Theorems 3 and 4. 
As for the hardness, the same reduction as Theorems 3 and 4 is exploited, in which given a formula # a 
rule-observation pair P(^) = (P'''**(^), P°^'*(^)) has been built. It follows immediately from the proofs 
of Theorems 3 and 4 that O = {disabled} is an outlier <;=^ the formula # is satisfiable. Specifically, to 
see why this is the case for the proof of Theorem 3, it is sufficient to let X' = X in the '^'-part. □ 

Next the complexity of the WITNESS— CHECKING problem is studied. Interestingly, since 
WITNESS — CHECKING assumes the witness to be provided in the input, its complexity is affected by the 
chosen semantics, as briefly outlined in the previous section. 

Theorem 6. Let V = (P^^^ P°^^) be a rule-observation pair. Then, WITNESS -CHECKING is 
L NP-complete, for stratified P'''**, 

2. IJ2 -complete under brave semantics for general P'^^, and 

3. -complete under cautious semantics for general P'^^. 

Proof. 

1. As for the membership, given W C p°bs^ guess a set O and check that it is an outlier in V with 
witness W. To this aim it must be verified that conditions P('P)v\; |= -^W and P(P)w,o ^ -iW hold. 
Since P(P) is stratified this check is feasible in polynomial time. The same reduction as Theorem 2 can 
be exploited, in which given a formula <P a rule-observation pair P(<?) = (P'"'^(<?), P°'^'^(<^)) has been 
built. It follows immediately from the proof of Theorem 2 (by letting X' = 0) that W = {sat} is a 
witness for an outlier in V{^) if and only if the formula # is satisfiable. 

2. As for the membership, given W C p"^^^ let us guess a set O and verify that it is an outlier in V with 
witness W. These latter tasks amount to solving an NP and a co-NP problem, as seen in Theorem 3. 
As for the hardness,the same reduction of Theorem 3 can be exploited, in which given a formula a 
rule-observation pair V{^) = {P'^^{^),P°^^{^)) has been built. It follows immediately from the proof 
of Theorem 3 (by letting X' = 0) that W = {sat} a witness for an outlier in the formula # is 
satisfiable. 

3. Both conditions P((P)y^ \=c -<W and P{T)w,o ~'W have to hold for a set O. The former condition 
can be checked in co-NP, whereas the latter amounts to guessing both an outlier O and a model for 
P(7')w,0' which is feasible in NP. 
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As for the hardness, let (j) = A ... A be a boolean formula on the set of variables xi , . . . , 
^'k ~ 1 ^ 2 ^ 3 (-'^ — ^ — ^^'^ (t>" = c'/ A ... A Cg be a boolean formula on the set 
of variables yi, . . . , y^, c'^ = ^ V 2 ^ 3 (1 < ^ < s). Without loss of generality, assume that 
the sets xi, . . . , a:„ and yi, . . . , have no variables in common. A rule-observation pair 'P((/>', (j)") = 
(/)"), ((/>', 0")) is defined such that (i) P((/>', 0")°^' is the set {sai, o), and (ii) 

is 

Xi <— not aj, o. 1 <i <n 

ai not Xj, o. 1 <i <n 

Cfe^P(*fe2)'0- l<A;<r 
4^P(<3)'«- l<A;<r 
yj ^ not hj , not o. 1 < j < m 
bj ^ not Uj , not o. 1 < j <m 
c'h^ p{th^i),noto. l<h<s 

^ Pi^h 2)' '^'^^ ^- i < h < S 

^ P(^h 3)' ""^^ ^- i < h < S 
SQjtj ^ ^1 ' ' * * ' • 
SQit ^ Cj^ 5 . . . J Cg . 

Now it is shown that (f)' is unsatisfiable and (p" is satisfiable there exists an outlier in P{4>', (p") with 
witness W = {sat}. 

(^) Assume that (j)' is unsatisfiable and 0" is satisfiable. It is shown that {0} is an outlier with witness 
{sat}. Indeed, given that o is true in V{(j)', 4'"){sat} given the encoding of this program, there 
is no model M' £ SJV\.{P{(t)' ,(()") {sat}) such that sat G M' , because (j)' is not satisfiable. Thus, 
P((/>', (j)"){sai} \=c ^ sat holds and condition (1) in Definition 1 is satisfied. Similarly, given that 
o is false in V{(t)' ,(j)"){gat},{o} given the encoding of the program, there is a model M" G 
SM{V{(I)', (t)"){sat},{o}) such that .sat G M" , because cj)" is satisfiable. Thus, 0"){sat},{o} V=c 
-1 sat holds, and condition (2) in Definition 1 is satisfied as well. 

(=^) Assume that there is an outlier O in cp") with witness W = {sat}. It is shown that (j)' is 
unsatisfiable and ^" is satisfiable. Indeed, notice beforehand that the only set candidate to be an 
outlier is {o}, because sat and o are the only observations at hand. Then, because of Definition 1: 
(1) there is no model M' G 5A1(P(^', 0"){sat}) such that sat G M' , and (2) there is a model 
M" G SM{V{(t)' , <p"){sat},{o}) such that sat G M" . Given the encoding of the programs above and 
the fact that o is true in V{'^' , (t>"){sat} while it is false in V{(t>' , 0"){sot},{o}' we conclude from (1) 
that (j)' is not satisfiable, and from (2) that 0" is satisfiable. □ 

Finally, the next theorem provides the complexity for the OW— CHECKING problem, in which it is simply 

checked whether two given sets O and W are an outlier and a witness for it, respectively. Notice that problem 
OW— CHECKING is relevant as it may constitute the basic operator to be implemented in a system of outlier 
detection. Interestingly, in this case, the complexity does not depend on the semantics. 

Theorem 7. Let V = (P'''^ P"*"^) be a rule-observation pain Then, OW-CHECKING is 

1. V-complete, for stratified P^^^, and 

2. T>^ -complete (under both brave and cautious semantics) for general P'^^^. 
Proof. 
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1. As for the membership, given W,0 C pot>s^ ^et us check in polynomial time that P(P)w |= 

and that P{'P)w,o ^ As for the hardness, given a stratified logic program P and an atom a, the 
P-complete problem of deciding whether P |= a is reduced to OW— CHECKING. Consider the rule- 
observation pair V with P'^'' = PU{a ^ not b} and = {a, b}, where b is a new propositional 
letter not occurring in P. It is shown that P \= ~'Ci <^ {o,} is an outlier witness for {6} in P. 

(^) Assume that P \= -la. Then, {a} is an outlier witness for {6} in V. Indeed, consider the program 
P{V){a}, and notice that a cannot be entailed because of the assumption and of the fact that the 
body of the rule a ^ not b is false because b is true in P{V)^a}- Thus, condition (1) in Definition 1 
holds. As for condition (2), consider the program P{V){a},{b} notice that it entails a, precisely 
because of the rule a ^ not b. 

(<=) Assume that {a} is an outlier witness for {6} in V. To conclude that P \= -la, it is sufficient to 
consider condition (1) in Definition 1. Indeed, it must be the case that P{V){a} does not entail a. 
Given that the rule a ^ not b cannot be used for entailing a (because b is true in P{'P){a})' it can 
be concluded that a cannot be entailed by P. 

2. As for the membership, given W, O C P°b^ conditions P{V)yv H and P{'P)w,o ^ can 
be checked respectively in co-NP and NP under cautious semantics, and respectively in NP and co-NP 
under brave semantics, hence in both cases the conjunction of a NP and a co-NP problem has to be 
answered, that is a problem. As for the hardness the proof is analogous to Point 3 of Theorem 6, by 
letting W = {sat} and O = {o}. □ 

Up to this point, attention has been focused on outlier decision problems. Turning to outlier computation 
problems, the following result can be established by noticing that in the proofs of the EXISTENCE problem 
(Theorems 2, 3 and 4), solving a satisfiability problem is reduced to computing an outher. 

Corollary 1. Let V = P^^"^) be a rule-observation pair Then, the COMPUTATION problem, i.e., com- 
puting an arbitrary outlier in V, is (i) FNP-complete, for stratified rule components, and ( ii) -complete, 
for general rule components. 

3.4 Data Complexity of Outlier Problems 

In all the complexity results derived so far, a setting has been considered in which both the rule component 
and the observation component are part of the outlier detection problem input. In order to have a more 
complete picture, in this section the data complexity of these problems is investigated. That is, a fixed rule 
component is considered and a set of observations are, instead, provided as the input. Such a kind of analysis 
may be useful in the context of database applications, where one is usually interested in understanding how 
the complexity of a problem varies as a function of the database size (cf. [84]). In this respect, the analysis 
becomes more interesting if also non-ground programs are considered. Therefore, in the following, both the 
ground and non-ground settings are considered. 

Ground programs. Some further computational complexity notions are now recalled. Let C be a boolean 
circuit. The size of C is the total number of its gates. The depth of C is the number of gates in the longest 
path from any input to any output in C. A family {Cj} of boolean circuits, where Ci accepts strings of size 
i, is uniform if there exists a Turing machine T that, on input i, produces the circuit Cj. {Ci} is logspace 
uniform if T carries out its task using 0(log i) space. Then, AC° is the class of decision problems solved 
by logspace uniform families of circuits of polynomial size and constant depth, with AND, OR, and NOT 
gates of unbounded fan-in. 
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It is not difficult to see that, liandling ground programs, all the basic outlier detection problems have 
their data complexity lying in AC°. Indeed, let A denote the set of propositional letters occurring in the rule 
component. Under the data complexity measure, the rule component is fixed, while the input of the problem 
consists of the observation component. Nonetheless, it can be shown that the complexity of outlier detection 
problems on ground programs is independent of the observation component out of letters in A. Indeed, being 
the rule component fixed, if O is an outlier with associated witness W, then n A is an outlier as well with 
associated witness WD A. Thus, given a pair T = {P^^^, P°^^), it holds that there is an outher in 'P iff there 
IS an outher in Ve = {P'^^ P"*"" n A). Once the pair Ve is available, conditions 1 and 2 of Definition 1 can 
be tested using a constant amount of time as the number of outlier/witness pairs in Vg is upper bounded by 
2^" and the number of models of Ve is upper bounded by 2", where n denotes the number of letters in the 
set A, which is a constant. Both the latter and the former tasks can be solved in AC°. Thus, the following 
theorem holds. 

Theorem S. LetV = (P"^^', P°^^) be a rule-observation pair such that P is a fixed propositional logic 
theory. Then, EXISTENCE, OUTLIER- CHECKING, WITNESS -CHECKING, and OW- CHECKING are in AC°. 

As a consequence, outlier detection problems under the data complexity measure can be solved in poly- 
nomial time and are highly-parallehzable. 

Non-ground programs. The investigation of the data complexity for this setting can be carried out, by 
preliminary putting into evidence the features it shares with the ground case when both rule and observation 
components are part of the input. The basic idea is that the reductions used in proving complexity results for 
outlier detection problems (carried out via encodings into the rule component which is, therefore, required to 
be part of the input) can be "simulated" by some fixed non-ground encoding. In particular, the instantiation 
of such non-ground encodings can be made in such a way to simulate the encodings exploited in the case 
of ground programs. Let us consider, for instance, the data complexity for problem EXISTENCE under the 
cautious semantics. 

Theorem 9. Let V = {P^^^, P"*^**) be a rule-observation pair such that P'^^ is a fixed general logic program. 
Then EXISTENCE under the cautious semantics is E2 -complete. 

Proof. (Membership) Given a fixed general logic program P^^^ and a set of ground facts p°^^, it must 
be shown that there are two disjoint sets W, O C P°^^ such that P(P)w |= -iVV (query q') and 
P{T^)w,0 (query q"). Recall that the complexity of the entailment problem for general proposi- 

tional logic programs is co-NP-complete. Thus, a polynomial-time nondeterministic Turing machine with 
an NP oracle can be built solving EXISTENCE as follows: the machine guesses both the sets W and O, com- 
putes the propositional logic programs ground{P{V)vs>) and ground{P{V)\yp) - this task can be done in 
polynomial time since the size of these programs is polynomially related to the size of P°^^ and then solves 
queries q' and q" by two calls to the oracle. 

(Hardness) Let <P = 3XVY/ be a quantified Boolean formula in disjunctive normal form, i.e., / is a 
Boolean formula of the form di V . . . V dm, over the variables X = xi, . . . Xn, and Y = j/i, . . . y^. With 
the following set of facts P°^^ (<P) is associated: 

01 : unsat. 

02 : disabled. 

03,fe : variable3{xk) ■ 1 < k < n 

04,i : variableViyi, y(i+i) mod (g+i))- 1 < « < 9 

06,3 : disjunct{dj,d(j+i) ^od im+i),p{tj,i),i{tj,i), p{tj,2),i{tj,2), p{tj,3),i{tj,3))- i<j<m 
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where dj = tj^i A tj^2 A ij,3, I < j < m, £{t) denotes the atom occurring in the literal t, and p{t) is the 
constant pos, if t is a positive literal, and the constant neg, if t is a negative literal. Intuitively, the atoms 
03,*;, 04 j, and 05 together provide an encoding of the formula Such an encoding will be exploited by 
the subsequent rule part (see below) in order to evaluate the truth value of the formula cp. In particular, each 
atom os^fc is associated to a distinct existentially quantified variables, each atom 04,^ is associated to a distinct 
universally quantified variable, while each atom 05J is associated to a distinct disjunct dj occurring in to the 
formula As for the atoms unsat and disabled, they have the same role played in Theorem 4. 
The rule part of the pair is composed by the following fixed general logic program P'^^: 

' ro : disjunctTrue <— disjunct{., _, pos, Xi , pos, X2 , pos, X3) , 

variable3{Xi),variahle3{X2),variable3{X3). 
ri : disjunctTrue <— disjunct{-, _, neg, Xi , pos, X2 , pos, X3 ) , 

not variable3{Xi),variable3{X2),variable3{X3). 

r-j : disjunctTrue <— disjunct^., _, neg,Xi,neg,X2,neg,Xz), 

not variable3{Xi), not variable3{X2),not variable3{X3). 

rs : disjunctTrue ^ disjunct{., .,pos,Yi,pos,X2,pos,X3), 

variableVTrue{Yi),variable3{X2), variable3{X3). 

res : disjunctTrue <— disjunct{., _, neg, Vi, neg, Y2, neg, Y3), 

not variableiTrue{Y\) ,not variableWTrue{Y2),not variableiTrue{Y3) . 

{re4 : variableyTrue{Y) <— variabley{Y,^,not variable\/False{Y). 
res '■ variable^ FalseiY) <— variable\/{Y, S) , not variable\/True(Y). 

' ree : disjunct(dQ,di,pos,xo,pos,xo,pos,xo)- 
r67 : variable3{x()) . 
res '■ variabley{yo,y\). 

reg : unsound <— disjunct{Cl, C2, _, _, _, _, _, _), not disjunctIN{C2). 
r7() : disjuncti N(C2) ^ disjunct{C2, S). 
r7i : unsound ^ variable^ {Y1,Y2), not variable'i I N{Y2). 
r72 : variableWIN{Y2) ^ variable'iIN(Y2,.). 
_ r73 : sound <— not unsound. 

r74 : sat <— sound, disjunctTrue. 
rrs : unsat <— not sat. 
rre : unsat <— not disabled. 

Next, some comments on the rules composing the program are provided. 

Rules ro . . . rga are introduced in order to compute the truth value of the disjuncts composing the formula 
(p. Indeed, in the body of each of these rules, there is exactly one atom o^j, encoding a generic disjunct dj in 
^, and three atoms evaluating the truth value of the three literals occurring in to the disjunct. Notice that, 64 
rules are needed in order to represent all the possible schemes of disjuncts occurring into a # formula, that 
is, all the possible conjunctions of three boolean variables, either negated or positive, of two distinct types, 
i.e., either existentially or universally quantified. 

Rules rg4 and rgs serve the purpose to guess a possible truth value assignment to the universally quanti- 
fied variables in (p. Intuitively, being the ground atom variable\/True{yi) {variables/ False{yi) resp.) true 
in a model of the overall program, means the corresponding universally quantified variable yi is intended to 
be true (false resp.) in a suitable truth value assignment to the variables of # which is encoded by the model. 
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Rules rgg . . . rya prevent that the atoms o^^i and 05 j of the observation part, associated respectively to the 
universally quantified variables t/i and to the disjuncts dj of #, belong to some outlier or witness of the over- 
all pair. Indeed, if some of these atoms is removed from the observation component, then the rule component 
will not evaluate cp correctly. To this aim, the rule ryi (rgg resp.) will imply the atom unsound whenever 
an atom 04,2 (05J resp.) is present in the model while the subsequent atom 04,2+1 (05^+1 resp.) is not. This 
check can be accompUshed since each atom 04,^ {05 j resp.) carry both the constant yi (dj resp.) associated 
to the variable yi (the disjunct dj resp.) and the constant yi+i (dj+i resp.) associated to the subsequent vari- 
able yj+i (disjunct dj+i resp.). Thus, loosely speaking, universally quantified variables and disjuncts of the 
formula are chained together so that either none of them or all together can be removed without entailing 
unsound. In order to prevent the latter possibility, a dummy disjunct (see rule ree), which evaluates always 
true since it contains the dummy existentially quantified variable xq which is always assumed to be true in 
its turn (see rule rgy), and a dummy universally quantified variable yo (see rule rgs) are introduced in the 
rule part, referring respectively to the first disjunct di and to the first universally quantified variable yi of 
Since both ree and rgg cannot be removed for sure from the overall pair, then they guarantee that unsound 
is true in a model of the overall program if and only if at least an observation 04^1 or o^j is removed from 
the observations. 

Finally, rules r^. . . rye have the same purpose of the corresponding rules in the reduction described 
in Theorem 4, the only difference being now that in order to entail sat the rule r74 requires the formula is 
syntactically sound other than evaluating true. 

It is now clear that the rest of the theorem will follow a line of reasoning analogous to that of Theorem 4. 
For completeness, we continue completing the formal proof. 

Now it is shown that <P is vaUd there is an outlier in V{^) = {P^^^, P°^^). 

(^) Suppose that ^ is vahd, and let be a truth value assignment for the existentially quantified variables 
X that satisfies /. Consider the set W composed by the fact unsat plus all the facts variable3{xi) 
associated to the variables that are false in , that is the set {un8at}U{variable3{x) \ x G Lit((7^) }, 
and consider the set O composed only by the fact disabled. Note that the stable models M.y of the 
program P[V{<P))y\) are in one-to-one correspondence with the truth assignments ay of the universally 
quantified variables (consider rules tq^ and res). Now, since the formula is satisfied by cr^, for each My, 
sat € My and unsat ^ My. Hence, P(V{^))w \=c "'W. Conversely, the program P{'P{(P))w,o in 
which disabled is false, trivially derives unsat. It can be concluded that O is an outher in P{^), and W 
is a witness for it. 

(<^) Suppose that there is an outher O with witness W in V{^). As unsat is the unique fact in P°^^(<P) 
that can be derived by P{V{^))w,0^ then in order to satisfy condition (2) of Definition 1, it is the case 
that yy contains unsat. Furthermore, in order to satisfy condition (1) of Definition 1, disabled does not 
belong to W. Now, it is shown that {unsat} C W C {unsat, variable3(xi) , . . . ,variable3{xn)}. 
Since unsat must be false in P(#)v\;, because of condition (1) in Definition 1, it can be observed that 
P(<P)w entails sat and, thus, by rules rys and r74, it does not entail unsound. Assume now, for the 
sake of contradiction, that a fact 04 j (05 resp.) belongs to W. Then, program P(cp)w entails unsound, 
which is impossible — see the general comments about the reduction reported above. 
Let X be the subset {variable3{x) \ x G (W \ {unsat})} and let be the 
truth value assignment cri^{x-i_,...,xn}\x)yj^x to the set of variables X. Clearly, P(7^(#))>v \=c 
{{variable3{xi), . . . ,variable3{xn)} \ X) U -lAf. Furthermore, as P(P(#))w \=c -'unsat, then it 
is the case that for each subset Y of Y, the stable model My of P('P(#))w associated with Y, that 
is the model My containing {variable'iTrue{y) \y & Y} and no other fact of the same predicate, is 
such that sat G My. That is, for each truth value assignment to the variables in the set Y, there is at 
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least a disjunct such that a o a makes the formula / true. As a consequence, <P is valid. To conclude 
the proof, note that O = {disabled} is always an outlier having such a witness. □ 

Theorem 9 depicts a strategy to reformulate the reduction illustrated in previous Theorem 4 in terms of 
a reduction exploiting a non-ground rule-observation pair whose rule component is kept fixed. Notably, 
this strategy can be also adopted for all the other reductions exploited in the proofs presented so far. For 
instance, we leave to the careful reader to check that, by adapting the same line of reasoning to Theorems 
2 and 3, the following results concerning outher detection problems under the data complexity measure can 
be eventually obtained. 

Theorem 10. Let V = ^p^is pobs^ ^ rule-observation pair such that P^^^ is a fixed stratified logic 
program. Then EXISTENCE is NP-complete. 

Theorem 11. Let V = (P"^'^, P°^^) be a rule-observation pair such that P^^^ is a fixed general logic pro- 
gram. Then EXISTENCE under the brave semantics is U2 -complete. 

4 Minimum-size Outlier Detection 
4.1 Extending the Framework 

There are several practical situations in which the computation of just any outlier is not what is really sought. 
Consider, for instance, the short story in the Introduction, and recall that Nino is considered to be an outlier 
by his Finnish colleagues because he is the only person with black hair and brown eyes among the other 
people working for the team. This assumption appears quite reasonable, given our intuition of outliers being 
individuals whose behavior is deviant w.r.t. the "normal" one. However, it is not the only possible one; 
indeed, by changing the roles played by Nino and the other team members, it might be concluded that all the 
team members but Nino are in fact outliers and that Nino having black hair and brown eyes is the associated 
witness. If asked to assess the correctness of the latter conclusion, most of us would certain disagree with it, 
because we implicitly associate the notion of normality with the characteristics embodied by the majority of 
the observations at hand, and for we are inclined to label an individual as anomalous precisely because it is 
rare in the observations. 

Therefore, it is sometimes natural to constrain the basic notion of outliers, formally defined in Def- 
inition 1, in order to account for some criteria aiming at singling out outliers of minimum size. In this 
section, the outlier detection problem will be studied with the additional constraint of minimizing the out- 
lier size. It is worthwhile noting that this setting differs with what is generally required by the principle 
of minimal diagnosis [76], where minimahty according to set inclusion rather that to cardinaUty is of- 
ten considered. In fact, the inclusion based minimality criterion does not generally prevent the possibil- 
ity of having outliers involving the majority of the observations at hand as, e.g., in the above reported 
example. As a further (extreme) example, consider the rule-observation pair Vi = (P['^, Pf'^^), where 
P[^^ = {a not oi, ...,not On- b <— not o.} and P°^^ = {oi, On, o}. Then, there are two minimal 
outliers in V\\ {oi,...,Oii} whose associated witness is {a}, and {0} whose associated witness is {b}. 
Clearly, singling out the former outlier may be undesirable in several situations, since it includes all but 
one observation and therefore fails in conveying information about abnormality with respect to the observed 
population; the latter outher, instead, seems to better reflect the intuition beyond outher detection problems. 

A first natural problem that arises in this setting is to decide about the existence of outliers of bounded 
size. For instance, it may be interesting to decide whether there are outhers consisting at most the 5% of 
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the observations or whether there are outUers consisting of one individual only. Actually, it is next shown 
that bounding the size of the outUers we are looking for does not affect the complexity of the EXISTENCE 
problem. 

Theorem 12. Given in input a rule-observation pair V = (P"^^^, P°^^), and a natural number k, the 
EXISTENCE[k] problem of deciding the existence of outlier O of size at most k (\0\ < k) in V is 

1. NP-complete, for stratified P^^, and 

2. E^, for general P'^. 

Proof For the membership it is sufficient to observe that the membership parts in Theorems 2, 3 and 4 can 
be modified by verifying that the guessed outlier O has size at most k. Such a test is feasible in polynomial 
time, and hence does not affect the complexity of the algorithms. As for the hardness it is sufficient to 
observe that in the proof of the theorems above, if {disabled} is an outlier, then the formula is satisfied. □ 

Similarly, an analogous version of the problem WITNESS — CHECKING can be formalized: given W C 
pobs ^ fixed natural number fe, is W a witness for any outlier O in V, such that \0\ < kl This problem 
will be called WITNESS-CHECKING[k]. 

Interestingly, this time bounding the size of outlier indeed influences the complexity associated with 
the problem. In fact, for general LPs it becomes D^-complete even under brave semantics (rather than 
complete), and for stratified LPs it becomes even feasible in polynomial time. This is shown in the following 
theorem. 

Theorem 13. Let V = P"""^) be a rule-observation pair. Then, WITNESS- CHECKING [k] is 

1. P-complete, for stratified P'''^, and 

2. -complete ( under both brave and cautious semantics) for general P"^'®. 

Proof. 1. As for the membership, given W C P°bs^ it has to be verified that, there is O C (P°bs \ yy) 
such that \0\ < k, both P{V)w \= -^W and P(P)w,o hold. Since the number of such outliers is upper 
bounded by |pobs|fe^ hence polynomially time bounded, and since P{V) is stratified, the overall check 
is feasible in polynomial time. As for the hardness, a reduction is exploited to the P-complete problem 
PROPOSITIONAL STRATIFIED LOGIC PROGRAMMESfG, i.e. the problem: given a propositional 
stratified logic program P and an atom a, decide whether P \= -la. Consider the rule-observation pair 
V = {P'^^, P°^^), with P"^'^ = PU{a^^b} and P°^^ = {a, 6}, where b is a propositional letter not 
occurring in P. Clearly, P'^^ is stratified and there is an outUer O of size \0\ < 1 in P having witness 
W = {a} iff P ^ ^a. 

2. Firstly, let us consider brave semantics. Let W be a subset of P°^, and let Oi, . . . ,Om, rn = 
^IPobsy^l^ ^ ^|p°b=\w|^ + . . . + ^ be all the subsets of P°^« \ W having size equal or less than 

k. Notice that deciding checking whether P('P)w \=b -'W, i.e., whether condition (1) in Definitionl 
is satisfied, is feasible in NP since it amounts to guessing a model for P(P)w and at verifying that it 
entails -iW. 

Let us consider, instead, condition (2) subject to the size constraint, i.e., it must be decided the existence 
of a set O, with \0\ < k, such that P{V)w,o ~'W ■ Actually, the complementary condition can 
be faced, denoted by C2, of deciding whether for each set O G {Oi, ... , Om}, there is a model M of 
P{V)w,o such that M entails ^W. 

To this aim, let Pi, 1 < i < m, denote the new logic program obtained from P{V)y\;^Oi by replacing 
each propositional letter p occurring there with a local copy of p, say p^, and let Wi denote the new 
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set obtained from W by substituting to each prepositional letter w there occurring, the local copy w'' 
of w in Pi. Consider the logic program P' = Pi U . . . U and the set W' = Wi U . . . U Wm- We 
note that both P' and W' can be built in polynomial time. Now, it is easy to see that, by construction, 
P' if and only if is satisfied. Thus, given W C P°^\ checking that P(7^)w \=b and 

that P{V)yv,o ~'W amounts to solving an NP and a co-NP problem. As for the hardness, given two 
boolean formulas and (/>', the problem of deciding whether ^ is satisfiable and 0' is unsatisfiable can 
be reduced to WITNESS — CHECKING[k] under brave semantics using a reduction similar to that shown in 
Point 3 of the proof of Theorem 6. 

Finally, the proof for cautious semantics follows the one of Point 3 of Theorem 6. □ 



4.2 Complexity of Computation Problems 

As already done in the context of Section 3.2, next let us concentrate on computation problems. Specifically, 
interest is in the COMPUTATION[min] problem: computing an outlier whose size is the minimum over the 
sizes of all the outliers - by min(V) the minimum size value is denoted. Notice that in the case of no 
outlier, min{V) is undefined. To this aim, the computational complexity of a (still, decision) variant of the 
OW-CHECKING problem is studied, denoted by OW-CHECKING[min], in which the attention is focused on 
checking minimum-size outliers only: given O, W C pobs^ is O an outlier in V, with witness W, such that 
\0\= min{V)l 

Theorem 14. Let V = (P""^^, P"^^) be a rule-observation pair Then, the problem OW-CHECKING[min] is 

1. co-HV -complete, for stratified P^^^, and 

2. -complete ( under both brave and cautious semantics), for general P""^^. 
Proof 



1. (Membership) Given O, W C po^s^ let consider the complementary problem OW— CHECKING[min] of 
deciding whether // is not true that O is an outlier in V, with witness W, such that \0\ = min{V). This 
problem can be solved by building a polynomial-time nondeterministic Turing machine that (i) verifies 
in polynomial time that O is an outlier with witness WvaV (if it is not the case then the machines stops 
replying "yes"), {H) guesses the sets set O' and W' , and verifies in polynomial time that (iiz) O' is an 
outlier with witness W' in V, and {iv) \0'\ < \0\ - notice that both (i) and (iv) have been observed 
to be feasible in polynomial time in the membership part of Theorem 2. Then, OW— CHECKING[min] is 
feasible in NP and, hence, OW-CHECKING[min] is in co-NP. 

(Hardness) Recall that deciding whether a Boolean formula in conjunctive normal form is 
not satisfiable, is an co-NP-complete problem. Consider again the rule-observation pair P{$) = 
(P'^'^(<?), P°'^®(#)) built in the proof of Theorem 2. Then, we build in polynomial time the rule- 
observation pair = (P"is(^),P*°bs(^)) such tiiat: (i) P*°bs(^) = pobs^^) ^ P'°^', where 
P'°b^ = {w, 01,02}, and (ii) P"i«(#) = P'^{^) U P""^" where P""^' is: 

w <— not Ol , not 02 ■ 

01 ^ 02. 

02 ^ Ol. 

It is easy to see that the set O = {oi, 02} is an outlier in 'P*{<P) with witness W = {w}. Moreover, 
min{V* {<P)) = \0\ <^ <P is not satisfiable. Indeed, in Theorem 2 it was shown that ^ is satisfiable ^ 
O' = {disabled} is an outlier in 7^(#). Hence, the result follows by observing that O' is such that 

\o'\ < \o\. 
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2. (Membership) The membership in iT|' derives from the fact that the complementary problem 
OW— CHECKING[min] can be solved in E2 under both brave and cautious semantics. Indeed, a 
polynomial-time nondeterministic Turing machine with an MP oracle can be built that {i) verifies that O 
is an outlier with witness W mV making two calls to the oracle (if it is not the case then the machines 
stops replying "yes"), {ii) guesses the sets set O' and W', and verifies {in) that O' is an outlier with 
witness W' in V making two other calls to the oracle, and {iv) that \0'\ < \0\- notice that both (i) and 
{iv) have been observed to be D^-complete in Theorem 7. Then, OW— CHECKING [min] is feasible in E2 
and, hence, OW- CHECKING [min] is in ill'. 

(Hardness) Let <P be a quantified Boolean formula in disjunctive normal form. Recall that deciding 
whether <P is not valid is a ill'-complete problem. The result follows immediately by exploiting the 
reduction described in the Hardness part in the Point 1 above, the only difference being that the rule- 
observation pairs V{^) to be considered are respectively those described in Theorems 3 and 4. To this 
aim it is sufficient to observe that in the proofs for the two theorems above, it was shown that the formula 
# is satisfiable if and only if there is an outlier of the form {disabled}. □ 

By exploiting Theorem 14, it can be shown an FZ'|' (FI7|' resp.) upper bound to the complexity of 
the problem of computing an outlier having minimum size in stratified (resp., general) pairs. This can be 
done by first guessing any outlier with an associated witness and then verifying that it is indeed minimal. 
But, actually better than this can be achieved by defining a more efficient computation method based on 
identifying the actual value of min{V) and, then, guessing an outlier whose size equals ■min{V). 

The case of stratified rule observation pairs is firstly considered, for which the problem turns out to be 
F/A2'[0(log n)]-complete, where n denotes the size of the rule-observation pair. The result can be derived 
by establishing a one-to-one correspondence between outliers and cliques in a graph - computing the size 
of the maximum clique was, in fact, shown to be FZ\2'[0(log n)] -complete in [57]. Notice that, to this end, 
the proof of Theorem 2 does not help because it exploits a construction from the satisfiability problem 
and, more importantly, because the guess of the true variables in the assignment is shared by the outlier 
and its associated witness, so that for any satisfying assignment there is an exponential number of associated 
outliers. Therefore, the most relevant technical problem to be solved in order to prove the result is to establish 
the cited one-to-one correspondence between outliers and graph cliques, by exploiting a more intricate 
reduction^ than the one used in Theorem 2. 

Theorem 15. Given a rule-observation pair V = (P'^^'^, P°^^), computing the value min{V) (if defined) is 
FZ\2'[0(log n)]-complete, for stratified P^^^. 

Proof. (Membership) Given the pair V, it is preliminary observed that the maximum value of min{V) 
is max = \p°^^\ = 0(17^1). Then, by a binary search on the range [0,max], the value min{V) can be 
computed: at each step of the search, a threshold is given in the range [0,maa;], say k, and it has to be 
decided whether the problem EXISTENCE[k] has some solution. After log max steps at most the procedure 
ends, and the value of min((P) can be returned, if this value is not zero. Since EXISTENCE[k] is feasible in 
NP, it follows that the procedure is feasible in FpNP['^('°e")l = F/df [0(log n)], where n is \V\. 

(Hardness) Given a graph G = {A, E), with A = {1, . . . , n} being a set of nodes and E A x A 
a set of edges, a clique C for G is a set of nodes such that Vi, j € C, there is an edge G E. Recall 
that computing the size of the maximum clique in a graph is FZ\f [0(log n)] [19]. A rule-observation pair 

^ This reduction might indeed also be used to prove Theorem 2. However, its intricacies would have made understanding the 
intrinsic complexity of the basic outlier detection problem much more difficult to grasp. 
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r{G) = {P'^^{G),P°^%G)) is built such that: (i) P'^'^^G) contains the facts , and yi, for each 

1 < i,j < ns.t. 
1 < i < n 
l<i<n 
l<i<n 

1 < i,j < n 

^ < i, j ^ n 
^ < i, j < n 
1 < i, j < n 
1 < i < n 
l<i<n 

Let C be a clique in G. Let us denote by the set {x{^ , . . . , } ^ {Vi \ = in}, where each ii is in 
(resp. out) if and only if the node i is not (resp. is) in C. Next it is shown that the above construction is such 
that (i) for each clique C, the set O'-^ is an outlier with witness W = {yes} in V{G); and {ii) each outlier 
O in r{G) is of the form O = O*^, for some chque C. 

(i) Let C be a clique in G. Consider the set W = {yes}. Then, each fact of the form xf^, a;°"*, and yi is 
true in the program P{V{G))y\>- Hence, due to the rules unsat ^ x*", unsat is true in turn, and 
P{V{G))y\> \= -'yes, i.e., condition (1) in Definition 1 is satisfied. Let us now consider the program 
P(7'(G))w,o, for O = O^. It must be shown that N 

yes. To this aim, the unique 

stable model of P(V{G))yv,o is proven to be M = {yes} U {ok, oki,... , okn} U {x'f', xi^} U 
{xf"*, . . . , .x°"*} U {yi, . . . , y„} \ O^, i.e., M is the unique minimal model of -P(7'(G))vv,o- Actually, 
the structure of P{'P{G))y^ q is the same as the one of P{V{G))y^^o but for the rule rg : yes <— 
not unsat, ok replaced by the rule v'q : yes ^ ok., because of the fact that unsat is false in M. Let us 
first note that M is in fact a model of P{'P{G))y^ q, by exploiting the following arguments: 

- Rules rs, and rs are satisfied by M, as ok and all the facts of the form oki are true in M. 

- Rule Tg is satisfied by M, because yes belongs to M. 

- Rules ry, rs, rg, and rio are satisfied by M, because unsat does not belong to M. 

- Rules rii and ri2 are satisfied by M. Indeed, by construction of the set O'-^, yi O*-^ iff G O'-^ 
and, therefore, yj G M iff x]" G M holds as well. 

- Rule r2 is satisfied by M, because of the construction of M and O'-^ which prevents from xf^ and 
^out being both true at the same time (a node is either in the chque or it is not). 

- Rule ri is satisfied by M. Indeed, it is preliminary noticed that the body of the rule accounts for 
pairs of nodes that are not coimected by means of an edge in E. Then, for each pair of predicates 
xf^ and xj^ being true in M, it is the case that they are connected by means of an edge in E; this 
is because a;°"* and a:°"* are in fact in O'-'' and, therefore, are nodes of the chque C. Thus, unsat 
cannot be entailed by this rule. 

To conclude the proof of (i), it is necessary now to show that M is in fact the minimal 
model of the program P{V{G))y^ q. Let us preliminary notice that the atoms in {x^", . . . U 
{x?"*,...,x°"*} U {yi,...,2/n} \0'^ came as facts in P{V{G))^q and, therefore, they must 



node i 'va A, and the fact yes; {ii) P^^^{G) is 
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be in any model of P{T'{G))^ ^. Then, the result follows by noticing that P{V{G))y^ entails 
{yes} U {ok, ok\ , . . . , okn}- Indeed, contains by construction one element in the set {xf', for 
each Xi. Therefore, it is possible to entail oki, for each 1 <i <n, and ok in turn. Consequently {yes} 
must occur in any model of P{V{G))^ 
(ii) Let O be an outlier for V{G). In order to prove the result, some properties of the encoding of G are 
discussed. 

Property Pi: Any witness W for O is such that P(P(G))w h ok and also that P(P(G))w,c> H ok. 

Proof. For the sake of contradiction, assume that ok is not entailed in P{'P{G))\\>. Then, due to 
rules rs and r4, it must be the case that there is a node i such that both xf^ and a;""* are in W. 
But, if this is the case, whatever the elements in O are, there is no chance of entailing ,t or .t°"* 
in P{V{G))y^^o- Indeed, although stratified logic programs are not monotonic, the monotonicity 
property still holds for predicates in the first stratum and the result follows by noting that, except 
for the predicate yes, all the other predicates he in the first stratum of the stratified logic program 
P{V{G)). It can be concluded that P(P(G))w,o \= ^ok and, hence, P{V{G))w,o cannot entail 
any fact by means of rules whose body contains ok. Given that these rules are those which may lead 
to entail facts in the observation components, this is a violation of condition (2) in Definition 1 . 

Property P2: Any witness W for O is such that P(P(G))vv; |= unsat. 

Proof. After Pi it can be assumed, without loss of generality, that ok is true in P(P(G))w. Before- 
hand notice that if x^" is in W then yi is in W too, otherwise, xf^ would be entailed by rule ri2 in 
P{V{G))y^ thereby violating condition (1) in Definition 1. Symmetrically, if yi is in W then xf^ is 
in W too, because of rule rn. This means that the body of the rules rn and ri2 are always false in 
the program P(V{G))y\;^o- Now, assume for the sake of contradiction that unsat is not entailed by 
P(7^(G))vv;. As unsat belongs to the first stratum of P{V{G)), then it follows that unsat is also 
false in the program P{V{G))y\;p. Then, the only fact among those in P°^^{G) that can be entailed 
in P(P(G))w,c» is yes that, consequently, must belong to W. But this cannot be the case because if 
unsat is false, also P(P(G))w \= yes. Hence, O is not an outlier. 

Property P3: Any witness W for O is composed exactly by the fact yes. 

Proof. Recall that W is such that ok and unsat are true in P{V{G))y^ (property Pi and P2 above). 
Then, assume that any fact of the form x"\ x""*, or yi is in W. In order to satisfy condition (1) in 
Definition 1, it must be the case that all the facts having this form are in W, since any fact x^", x°"*, 
or yi will suffice for entailing all the others. However, in this case ok is false. Contradiction. 

Armed with these properties, it can be concluded that the outlier O must be such that yes is entailed in 
the program P{V{G))y\!^o^ i-e. that both -^unsat and ok are entailed in the same program. To this aim, 
for each node i, either xf' or x°"* is in O (but not together, because of rule r2), and x^" is in O if and 
only if yi is in O (because of rules rn and ri2). Furthermore, in order to have unsat not entailed, it 
must be the case that the set C = {i\ xf^ O} corresponds to a clique, because of the rule ri (unsat is 
entailed as soon as two nodes Xj and Xj are marked in while being not connected by means of an edge). 
The result follows by observing that O with the properties stated above coincides with . 

In order to conclude the proof, simply observe that given a clique C the corresponding outlier is such 
that = n + \{yi \ £i = in}\ = n + n — \G\. Thus, the clique having maximum size is in one-to-one 
correspondence with the outlier having minimum size, and computing miniV) amounts to compute the size 
of the maximum cUque. □ 
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One may ask what is the complexity to be paid to compute the value min{V) for general programs. The 
following theorem partially answers the question, by providing an upper bound for it, while leaving it open 
an exact characterization of its intrinsic complexity. 

Theorem 16. Given a rule-observation pair V = (P^^*^, P°^^), computing the value min{V) (if defined) is 
in VA^{0(\og n)] (under both brave and cautious semantics), for general P^^. 

Proof. The proof is the same as for the membership part in the Theorem 15, except for the fact that 
EXISTENCE[k] is feasible in U2 for general logic programs. Then, the binary search can be implemented in 

ppSl- [0(log n] ^ p^P |-o(log n)] . □ 

Using the result demonstrated above, it is not difficult to see that given a rule-observation pair V, an 
outlier of minimum size can be computed in polynomial time using an NP (resp. E2) oracle for stratified 
(resp. general) logic programs. 

Theorem 17. Given a rule-observation pair V = {P'^^, P°^^), computing an arbitrary outlier O such that 
min{V) = \0\ (if defined) is 

1. in FZ\2'[0(log n)], for stratified P'^^, and 

2. in F/A|'[0(log n)] ( under both brave and cautious semantics), for general P'^^. 

Proof The problem can be solved by (i) computing the value min(V) — it has been seen that this value can 
be computed performing 0(log \ V\) calls to an NP (resp. oracle, and then (ii) guessing an outlier O 
having size \0\ = min{V), together with its witness W, and checking conditions (1) and (2) of Definition 
1. Point (ii) above is feasible with an extra NP (resp. oracle call for stratified (resp. general) logic 
programs. Hence, the following result follows. □ 

One may wonder whether the computation problem is, in fact, complete for the above complexity 
classes. Actually, for the case of stratified LPs, the simple membership result can be sharpened by assessing 
its precise complexity, which account for the possibility of having several outliers with size min(V). To this 
end, it is necessary to recall some further complexity notions. 

An NP metric Turing machine MT is a polynomial-time bounded nondeterministic Turing machine 
that on every computation branch halts and outputs a binary number. The result computed by MT is the 
maximum over all these numbers. The class OptP contains all integer functions that are computed by 
an NP metric Turing machine, whereas OptP[0(log n)] is that subclass thereof containing all functions 
/ whose value f{x) has O(logn) bits, where n = \x\. The class FNP//OptP[0(log «)] contains all (par- 
tial) multi-valued functions g for which a polynomially-bounded nondeterministic Turing machine T and 
a function h G OptP[0(logn)] exist such that, for every x, T computes the value g{x), provided that 
both X and h{x) are taken in input (see [19]). Notice that it is well-known that every multi-valued function 
g G FNP//OptP[0(log n)] has a refinement (single-valued) function / G FZ\f [0(log n)], i.e., for every x it 
holds that g{x) is defined iff f{x) is defined and f{x) G g{x). Moreover, a problem (with possibly multiple 
solutions for a given instance) is solvable in FNP//OptP[0(log n)] iff any of such solutions is computable in 
FZ\f [0(log n)]. 

In the light of this observation, the proof of Theorem 17 provides an algorithm for computing a refine- 
ment of COMPUTATION[min]; the complexity of COMPUTATION[min] on its own is, instead, more naturally 
defined in terms of the class FNP//OptP[0(log n)], where n = \V\ is the size of the rule-observation pair, as 
shown next. 
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Theorem 18. Let V = (p^is pobs^ ^ stratified rule-observation pair Then, COMPUTATION[min] is 
FNP//OptP[0(log n)]-complete. 



Proof. (Membership) The value min{V) has O(logra) many bits at most, where n is the size of the input, 
and it can be computed by an NP metric Turing machine. Then, let us guess two sets O and W and verify in 
polynomial time that (i) O is an outlier with witness W, and (ii) the size of O is min^V). 

(Hardness) A reduction can be shown to the X-MAXIMAL MODEL problem: Given a formula 
= ci A . . . A Cm in conjunctive normal form on the variables Y = {Yi, ...,Yh} and a subset X QY, 
compute a satisfying truth assignment M for ^ whose X-part is maximal, i.e., for every other satisfying 
assignment M' there is a variable in X which is true in M and false in M' . This problem was proved to 
be FNP//OptP[0(log «)]-complete in [19] under the following notion of metric reduction: A problem 77 
reduces to a problem 77', if there are polynomial-time computable functions f{x) and g{x, y), such that: (i) 
for any instance I of 77, /(7) is an instance of 77', and /(7) has solution iff 7 has a solution, and (ii) for any 
arbitrary solution S of /(7), g{I, S) is a solution of 7. 

A rule-observation pair V{(j)) = {P^^'^ {(/)), P°^'^{(f))) is built such that: (i) contains the two facts 

xl"', x°"* for each variable Yi in Y, the fact t/i for each variable Yi in X, and the fact yes; (ii) P'^'**(#) consists 
of the rules r2, . . . , ru of the encoding in the proof of Theorem 15, plus the following rule: 



r[ : unsat ^ z/(ij,2), i^fe.a)- 1 < j < m s.t. Cj = tj^i V tj^2 V ij,3 

where u is the following mapping: 

f x*?"* , if t = Yi 
^^(*) = |^L \i{t = ^Yi 

Let a be an assignment for (/). Let us denote by the set {z^^ , • • • , a;f " } U {yi \ £i = in AYi E X}, 
where each £i is in (resp. out) if and only if the variable Yi is not (resp. is) true in a. 

Let us comment the only difference with the encoding used in Theorem 15, i.e., the substitution of rule 
ri with rule r[ . Actually, rule ri was used to evaluate whether the set of facts of the form xf^ form a clique: 
if the clique is not correctly formed, then unsat is entailed by the program. The behavior of rule r[ is 
symmetric, because the rule checks whether the assignment at hand satisfies the formula: if the assignment 
is not satisfying, then unsat is entailed by the program. Provided this modification only, with the same 
arguments as in the proof of Theorem 15, it can be seen that (i) for each satisfying assignment a, the set O"^ 
is an outlier with witness W = {yes} in V{4>); and, {ii) each outlier O in V{(j)) is of the form O = O^, for 
some satisfying assignment a. 

Observe now that given an assignment a, the corresponding outlier is such that jO'^l = n + | 
£i = in AYi e X}\, and recall that 4 = in means that the variable is false in a. Hence, the reduction 
establishes a one-to-one correspondence not only between outliers and truth assignments for X, but also 
between minimum size outhers and the satisfying assignments for (p with a maximum number of X variables 
made true. Clearly, each of such assignments is also an X-MAXIMAL MODEL of ^. The result follows by 
observing that the construction can be done in polynomial time. □ 

It is worth pointing out that the reduction presented above is parsimonious [40]. In fact, by letting Y be 
the set of all the variables in the formula the theorem above estabUshes a one-to-one correspondence be- 
tween outliers and satisfying assignments. Then, as a side result, the cost of counting the number of outliers 
turns out to be the same as the cost of computing the number of satisfying assignments, the archetypical 
complete problem for the class #P [82]. 
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CoroUary 2. Let V = {P^^^, pobs^ ^ rule-observation pair such that P^^^ is a stratified. Then, counting 
the number of outliers in V is i^P-complete. 

As with minimal diagnosis applications, an interesting problem is that of singling out an outlier "core", 
that is, those facts that belong to all minimum-size outliers. This apparently difficult problem turns out to be 
not more difficult than computing the value of min{V), as shown in the following theorem. 

Theorem 19. Given a stratified rule-observation pair V = {P^^^, P'^^^'j and a fact f, deciding whether, for 
each outlier O having minimum size in V, it holds that f E: O is A2[0{log n)]-complete. 

Proof. (Membership) The problem can be solved in polynomial time with O(logn) many NP oracle calls. 
The value min{V) can be firstly computed by exploiting the algorithm presented in Theorem 15. Then, with 
an additional NP oracle it can be checked if there is an outlier O such that (i) \0\ = min{V), and (ii) f ^ O. 

(Hardness) Given a formula (f) in conjunctive normal form on the variables Y = {Yi, K„}, a subset 
X (ZY, and a variable Yi, deciding whether Yi is true in all the -MAXIMUM models is ^^^[(^(logra)]- 
complete, where a model is maximum if it has the largest X-part. The result trivially follows by exploiting 
the construction in Theorem 18 that, in fact, reUes on a one-to-one correspondence between X-MAXIMUM 
models and outliers of minimum size. □ 



5 Observing Rules 

The outlier detection framework depicted so far relies on agents' observations coming as facts that encode 
some aspects of the current status of the world. To this aim, it suffices that agents have some "sensing" 
capability for monitoring the external environment. However, it would be desirable to have agents that, 
besides sensing, would also be able to interact with the environment in a more elaborate way. 

For instance, in several multi-agent appUcations, agents may be involved in dialogues with other agents. 
Dialogues may start from the need to achieve an explicit goal, such as to persuade another party, or to find an 
information, or to verify an assumption (cf. [86]). Other form of dialogues may occur during a negotiation 
[56], i.e., when agents operate in an environment with limited resource availability and their goal is to obtain 
a resource (see, e.g., [87, 78]). In this contexts, agents might (maUciously) export knowledge for taking 
advantage of its competitor agents and, therefore, it is relevant to have agents equipped with the capability 
of having a set of facts and rules to encode their own "observations", i.e., the knowledge exchanged with 
other agents while communicating. 

Then, outlier detection techniques can be profitably used for singling out the pieces of knowledge (de- 
fined as sets of rules and facts and acquired by communicating with other agents or generally by learning 
from the environment) that look anomalous w.r.t. the trustable internal agent view of the world. 

Example 5. Consider again the agent that is in charge of monitoring the status of the network TV. 
Assume that has no a-priori knowledge about the connectivity, i.e., the rule component P^ is empty. 
Then, might have been informed by another agent of a property about M, which is encoded in the rule 
o : up(c) <— up(h). 

Armed with this rule, the agent monitors the status of the net and observes that h is up but, surprisingly, 
c is not. Clearly, there is something strange about these observations. However, in this case the agent might 
doubt the fact that h is actually up, but also about the rule o, which has been observed, but is not part of its 
trustable knowledge. < 
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In order to study this extended framework, it is necessary to introduce some changes in the basic def- 
inition. Let P'^^ be a logic program encoding general knowledge about the world, and let p^-°^^ be a 
set of facts and rules encoding some observed aspects of the current status of the world, called extended 
observation component. Then, the structure V = {P^^^, pe-obs^ extended rule-observation pair. 

It is worth noting that, in this novel context, the type of the logic program P{V) is determined by the 
rule program together with the extended observation set. 

It is easy to see that the complexity of the outlier detection problems studied so far remains unchanged 
for extended rule-observation pairs where P{V) is a stratified or a general logic program. Indeed, all the 
hardness proofs refer to P'^^^ including facts only (and hence hold in the extended framework) and all the 
membership results can easily accommodate rules in the observations (belonging to the same class of logic 
programs to which P'^^^ belongs to) without additional costs. 

However, it is interesting to study what happens in the case of positive rule components, for which 
Theorem 1 showed that no outher exists if observations are restricted to be facts. To analyze this scenario, 
it is first made clear that a rule r is entailed by a positive program P if r is satisfied by the unique minimal 
model M of P. 

Surprisingly, Theorem 1 does not hold for the case of extended rule-observation pairs. For instance, one 
can consider the extended rule-observation pair V = ^pri'^ pc-obs^ sNith P^^^ = and p^-obs _ j^, ^ 



b, b}. Then 6 is an outlier with witness a <— 6. Indeed Pi'P){b},{a^b} ^ ^ b), while P(P){a^bj |= 
-(a ^ b). 



Thus, in the following, the computational complexity of the EXISTENCE problem is stated when ex- 
tended rule-observation pairs are considered. This result helps in understanding the characteristics of de- 
tecting outliers over extended pairs — complexity figures for the other problems introduced in the paper 
(such as OUTLIER-CHECKING, WITNESS -CHECKING and OW-CHECKING) can be then obtained by simple 
adaptations of analogous proofs shown for the standard case, and are, therefore, omitted. 

Theorem 20. Let V = (P""'®, P^-°^^) be an extended rule-observation pair Then EXISTENCE is 

1. NP-complete, for positive and stratified P{V),^ and 

2. E2 -complete ( under both brave and cautious semantics), for general P{V). 



1. Membership can be proven with the same line of reasoning as the proof of Theorem 2. Therefore, 
let us focus on the hardness which shall be proved for positive P{V). Consider a Boolean formula in 
conjunctive normal form <P = ci A . . . A over the variables xi , . . . , x„, such that each clause contains 

at most three distinct (positive or negated) variables. 

An extended rule-observation pair P(^) = {P'^\<1>) , P''-°'°'' {$)) is defined such that: (i) P^-°bs(^) 
contains exactly the fact xj and xf for each variable Xi in plus the rule ok <— sat, (ii) P^^^{$) is 



Proof. 




sat < Cui. 

ok <— xJ , xf . VI < i < n 



* We thank one of the anonymous referees for having suggested this result and its proof. 
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where tt is the mapping: 

( xj , if t = Xi, 1 < i < n 
[xf,ift = -iXi, 1 <i <n 

Clearly, P(#) is positive and can be built in polynomial time. Now it is shown that # is satisfiable 4» 

there is an outlier in V{(P). 

(=^) Suppose that ^ is satisfiable, and take one of its satisfying truth assignments, say a, for the variables 
xi,. . . , Xn- Consider the sets = {ok ^ sat} U {xj \ Xi is false in a} U {x[ \ Xi is true in cr} 
andO'" = P^-^'^^XW .11 is now shown that is an outlier with witness . Indeed, the program 
P(P(#))w<T does not entail ok, because for each variable Xj either x/^ or xf is evaluated true. 
Moreover, it entails sat, because of the construction of and of the fact that the encoding evaluates 
the truth values of the assignment a, which is satisfying. Thus, P('P(^))w'^ |= ^{ok <— sat), and 
condition (1) in Definition 1 is satisfied. As for condition (2) in Definition 1, it is sufficient to observe 
that P{V{'P))y\)'' ^o'^ coincides with P{V{$))pc-ohs^^y, therefore, the program contains no fact and 
both sat and ok are evaluated false. Thus, P('P(^^))w'^,C"^ N (^^ ^ sat). 

(<;=) Assume that there is an outlier O with witness W in V{^). It is shown that <P is satisfiable. To this 
aim, notice that the rule [ok <— sat) must be part of W, because ok and sat are the only facts that 
can be entailed in the encoding. Then, because of condition (1) in Definition 1, it must be the case 
that P{V{<P))w \= -^{ok ^ sat), i.e., P(P(#))w must entail sat and not ok. By the fact that ok is 
not entailed, it can be concluded that for each variable Xi, at least one element in the set {xf, xf } 
is in W. Therefore, W assigns a truth value to some of the variables of # and may leave undefined 
some other variables (those variables whose corresponding facts have been both in W). Formally, 
a (partial) assignment a for the variables in <P can be defined such that if xf (resp. xf) is in W, 
then Xi is true (resp. false) in a. Then, by the fact that sat is entailed, it must be the case that a is 
satisfying. 

2. The proof is similar to that of Theorems 3 and 4. □ 
6 Implementing Outliers Detection Problems 

Now that the framework for outlier detection has been illustrated and its complexity has been investigated, 
attention can be focused on the problem of devising effective strategies for its implementation. Specifically, 
sound and complete algorithms are exhibited that transform any rule-observation pair V into a suitable logic 
program >C(P) such that its stable models are in a one-to-one correspondence with outliers in V. 

The most interesting aspect of this transformation is that, since stable models represent the solution 
of the outlier problems, it is possible to implement a prototype tool for finding outliers with the support 
of efficient stable models engines such as GnT [49], DLV [61] and Smodels [67]. In fact, reformulations 
in terms of logic programs under stable model semantics have been already exploited in the literature for 
prototypically implementing other reasoning tasks such as abduction (see, e.g., [51, 35]), planning (see, e.g., 
[81, 80]), and diagnosis (see, e.g., [23, 29]). 

6.1 Stratified Pairs 

In this section, the case of a pair V = {P^^, P°^^) is considered such that P"^'^ is a stratified logic program. 
The rewriting algorithm OutlierDetectionToStableModels is shown in Figure 5. It takes in input the pair V, 
and outputs a logic program C{V), which is built according to the following ideas. 
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Input: A stratified rule-observation pair V = (P^'*^, P°^^), where P°^^ 


— i rtV^Q t' 


Output: A logic program C{V)\ 






Method: Perform the following steps: 






1. C{V) := 0; 






2. /* Rule part rewriting */ 






for each rule r G p^^^ of the form a * — bi , - - ■ , bk, not ci , ■ 


• ■ , not 


Cn, insert into CCP) the rules 


(a) a!"- ^b5S---,bJi,not cf ,---,not c^K 




(b) a'^^ ^bf ,---,bf ,not cf .■■■,not cf. 






3. /* Outlier and witness guessing */ 






for each obsi G pobs insert into C^P) the rules 






(a) Oi * — not Oi . Oi — not Oi . 






(b) Wi ^ — not Wi . Wi — not wi . 






4. /* Observations definition */ 






for each obsi 6 poba^ insert into €(7^) the rules 






(a) obs^'^ < — not Oi, not v±. 






(b) obs?^ *— not Wi. 






5. /* OutUer and witness checking */ 






for each obsi e P°'•^ insert mto C{V) the rules 






(a) badCl «- wi, obsf . 






(b) satC2 <— Wi, obs?^. 






6. /* Constraints */ 






(a) for each obsi 6 pobs^ insert into C{V) the rule si ^ 


Oi, Wi, 


not si. 


(b) insert into C{V) the rule S2 «— not satC2, not S2 . 






(c) if P"^'^ is stratified then insert into C{V) the rule S3 <— 


badCl, 


not S3. 



Fig. 5. Algorithm OutlierDetectionToStableModels. 



First of all, a suitable rewriting of P^^ is inserted into jC(P). Let us denote by S[L] the rewriting of 
a set S of rules obtained by substituting each atom p occurring in S with the new atom p^. Then, the 
algorithm inserts in the Step 2 the programs P'^^'^[Cl] and P'^^''[C2]. Intuitively, P'^'**[Cl] is used for checking 
condition (1) in Definition 1, while P'^^'^[C2] is used for checking condition (2) in the same definition. 

Rules inserted in Step 3 serve the purpose to guess an outUer and its associated witness. Each fact obsi 
in P°^^ is associated with two new facts oi and wi, where, intuitively, oi (resp. wi) being true in a model 
means that obSi belongs to an outlier (resp. witness) in V. In other words, truth values of facts Oi and v± in 
any model for £-{V) univocally define an outUer and a witness set for it, respectively. 

Rules inserted in Step 4 serve the purpose of introducing in the program JC,{V) the rewriting [Cl] and 
P^yy[C2], in order to simulate the removal of the outlier and the witness from the set P°^^ which is needed 
for verifying whether both conditions in Definition 1 are satisfied. For each atom p in the set of observations 
pobs^ two rules are introduced. In particular, rule 4.(a) guarantees that obs^^ is true in the program if it is 
neither an outUer nor a witness. Similarly, 4.(b) guarantees that obs^^ is true if wi is not. 

Rules inserted in Step 5 evaluate conditions (1) and (2) in Definition 1. Indeed, the atom satC2 is 
evaluated true if a fact obs^ is true even if assumed to belong to a witness (wi true), i.e., if condition (2) in 
Definition 1 is satisfied in the model. Similarly badCl is true if obs^^ is true but obsi belongs to a witness 
(wi true), i.e., if condition (1) is not satisfied in the model. 

Summarizing, the subprogram of C{V) built in Steps 1-5 is such that it guesses the values for each Oi 
and Wi and verifies that both conditions in Definition 1 are satisfied. 

In order to finaUze the transformation, it is necessary to add some constraints (Step 6). The rules inserted 
in Step 6 have the form sj a, not sj . As sj does not appear in any other rule of the program JO,{V), then 
it must be false in any stable model of C{V). Thus, these rules act as constraints imposing that a must be 
false in all the models. Therefore, rules inserted in step 6. (a) impose that any obsi cannot be an outlier and 
witness at the same time, the rule added in step 6.(b) imposes that interest is only in stable models in which 
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satC2 is true, and, finally, the rule added in step 6.(c) imposes that interest is only in stable models in stable 
models in which badCl is false. 

The correctness of the algorithm is proved in the following theorem. 

Theorem 21. Let V = ^pi'is^ p°tis^ ^ stratified rule-observation pair, and let C{V) be the rewriting 
obtained by the algorithm in Figure 5. Then, 

1. for each outlier O with witness W in V, there is a stable model M of C{V) such that {obsi | oi G 
M} = Oand {obsi | wi € M} = W, and 

2. for each stable model M of C{V), there is an outlier O with witness W in V, such that {obsi | oi G 
M} = Oand {obsi | Wi G M} = W. 

Proof. 

1. Let O be an outUer with witness W in V. Let M\ denote the stable model of the program P{'P)\\;, 
and M2 denote the stable model of the program P{V)y\!^o- Consider the interpretations I^^ = {a*^^ | 

a G Ml}, ^ {aC2 I ^ g ^^1^ jguess ^ |q. | ^^3. g 0} j {q. | obSi O} U {Wi | obSi G 

W} U {wi I ObSi ^ W}, and 1°^' = {ohsf \ obSi W} U {obsf | obSi ^ (O U W)}, and let 
M = 7*^1 U 7*^2 J jguess y jobs u |satC2} be an interpretation of C{V). 

Now, it is shown that M is a stable model of C{V) such that {obsi | oi G M} = O and {obsi | wi G 
M} = W. To this aim, notice that by construction: 

- {obSi I Oi G M} = O, because Oi is in M if and only if obSi is in O; and 

- {obSi I Wi G M} = W, because Wi is in M if and only if obSi is in W. 

Therefore, it remains to show that M is a stable model of C{V), i.e., that it is the minimal model of the 
positive program C{V)^ . To this aim, let denote the program consisting of the rules added in the z-th 
step of the algorithm, and observe preliminary that C{V)^ is the program (P''^''[Cl] U {obSi^ | obSi ^ 
W})^U {P''^[C2\ U {obsf I ObSi ^ (O U W)})^U u Ip^^)M u (p6^M^ ^^^^^ applying 

the definition of reduct): 

A ■ {P'^[C1] U {obsf I ObSi ^ W})^ = (P(P)w[Cl])^''\ 

Indeed, by definition, P('P)w[Cl] is the program P''^''[Cl] U {obSi^ | obSi ^ W}. Moreover, 
all the predicates in such a program have the form p*^^, where p is a predicate symbol in P{V). 
Therefore, (P(P)w[Cl])*^ = (P(P)w[Cl])^'''^^°''^^'l°^'^^^l and the result follows because 
{obSi^ I obSi W} is a subset of 7*-^^. To see why the last containment holds, recall that 7*-^^ 
is a renaming of the model Mi which must contain all the observations that are not in W, by con- 
struction. 

B : (P'l«[C2] U {obsf I ObSi (W U O)})^ = (P(P)w,o[C2])^'''. 

Indeed, it can be appUed the same fine of reasoning of point A above. 
C : (p3)^ = {p. I p is an atom in 7^"^^*}. 

D : (P5)^ = {badCl ^ Wi,obs?^ | obsi G P""""} U {satC2 ^ Wi,obsf .}. 
E : Ip^)M = {s^ ^ Oi,Wi. I obSi G P"""*^} U {sg ^ badCl.}. 

Then, the result follows because of the following two properties: 
Property Pi: M is model of 

Proof. All the rules in C{V)^ (see points A — E above) are satisfied by M: 
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- Rules in A are satisfied by M. Indeed, program P('P)yv;[Cl] coincides with P{V)y^> modulo a 
renaming of the predicate symbols, and I^^ is in fact a renaming of the unique stable model 
Ml of P('P)v\;, by construction. Then, the result follows since Mi is, by definition, the minimal 
model of the positive program PiV)^ . 

- Rules in B are satisfied by M. Indeed, program P{V)y\),o[^'A coincides with P{'P)y\;,o modulo 
a renaming of the predicate symbols, and I^'^ is in fact a renaming of the unique stable model M2 
of P('P)w,o, by construction. Then, the result follows since M2 is, by definition, the minimal 
model of the positive program P{V)^(j. 

- Rules in C are satisfied by M since is a subset of M. 

- Rules in D are satisfied by M. Indeed, as for rules of the form {badCl <— Wi, obs^^. | obsi G 
pobsj^ notice that, by construction of lo^ess^ jg -^^ ^jjy o\>q^ g yy. Moreover, it 
is claimed that obs^^ is in M if and only if obsi ^ W, thereby having that the body of all such 
rules is always evaluated false by M. To see why the claim holds, observe that if obsi W 
then obSi^ is in M by construction. Let us now assume that there is a fact obsi in W and, for 
the sake of contradiction, that obs^^ is in M as well. It follows that obs^^ is in J'^^ and, hence, 
that obSi belongs to Mi by construction of the set I^^. But this is impossible because W is an 
outlier and, therefore, is such that all the facts in the witness set do not occur in the unique stable 
modelMiofP(7')w 

Moreover, as for the rules of the form {satC2 <— wi, obs^^.}, they are satisfied by M because 
satC2 is in M. 

- Rules in £■ are satisfied by M. Indeed, rules in the set {si <— oi, wi. | obsi G pobsi. satisfied 
by construction of the set which is in fact such that wi is in M if and only if obsi G W, 
and such that oi is in M if and only if obsi G O. Then, given that by definition W fl O = 0, the 
body of such rules is always evaluated false in M. To conclude, rule S3 <— badCl. is satisfied 
by M because badCl is not in M. 

Property Pr. There is no model M' of C{V)^ such that M' c M. 

Proof. Recall that program C{V)^ has the form shown in points A — E above and assume, for the 
sake of contradiction, that there exists a model M' c M for C{V)'^ . First, observe that F"^*'* C M' 
(see point C). Moreover, because of the fact that {obs^^ | obsi W} is a subset of i*^^ and that 
{obs^^ I obSi ^ (W U O)} is a subset of i*^^ (points A and B, respectively), it can be observed 
that M has in fact the form /<^^ U /'^^ U P"^** u {satC2}. Thus, the following scenarios can be 
distinguished: 

(a) Assume that 7*^^ n M' C 7*^^ Then, 7<^i n M' is a model for (P'''^[Cl] U {obs^^ 1 obsi ^ 
W}y''\ i.e., for (P(P)vi;[Cl])^'''. It follows that the set M{ = {a | a^^ G (7^^ n M')} is 
a model for P{V)^^ as well (notice that (P(P)h;[C1])-'^^' coincides with P{V)^^ modulo the 
renaming of predicate symbols). However, M'l is also a subset of Mi, and this is impossible 
because Mi is the stable model of P{V)y^ and, therefore, the minimal model of P{V)^ . 

(b) Assume that 7<^2 n M' C 7*^2 jhen, 7*^2 n M' is a model for (P"i'[C2] U {dbsf \ obsi ^ 
(OU W)})^""", i.e., for (P(P)w,o[C2])^'''. It follows that the set = {a | a^^ ^ (/f^2pJ^^/)| 
is also a model for P{V)^^(^ (notice that (P(7')w,o[C2])^^' coincides with P{V)^^(^ modulo 
the renaming of predicate symbols). However, M'2 is also a subset of M2, and this is impossible 
because M2 is the stable model of P{V)n>,o and, therefore, the minimal model of P{V)^q. 

(c) Assume that satC2 is not in M' . Then, after (a) and (b) above, it is the case that M' = 7*^^ U 
jC2 y jguess^ Then, sincc satC2 is not in M', because of the rules added in Step 5.(b), it holds 
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that for each Wi in M', obs^ is not in M'. Hence, by construction of jsuess^ follows that 
for each obsi in W, it is the case that obs^ is not in J*-^^. However, J*^^ is a renaming of the 
model M2 of P{'P)w,o- Therefore, the model M2 does not entail any fact in W. But this is a 
contradiction with condition (2) in Definition 1. 

2. Let M be a stable model of C{V). First of all, note that by rules inserted into C(V) in Step 6 of algorithm, 
M is such that {i) for each letter obSi in and cannot belong simultaneously to M, (ii) 

satC2 G M, and (iii) badCl M. Furthermore, by rules inserted into C{V) in Steps 3. (a) and 3.(b) it 
is the case that, for each letter obSi in P^^^, either Oi or oi and either Wi or Wi belong to M. 
Consider, now, the disjoint sets O = {obSi | Oi G M} and W = {obsi | wi G M}. It has to be shown 
that both conditions in Definition 1 are satisfied. To this aim consider the interpretation /s^esA _ | 
obSi G 0} U {oi I obSi 0} U {wi | obsi G W} U {wi | obSi W}, and notice that M can be 
written as l'^^ U l'^'^ U i^^'^** U {satC2}, where l'^^ and 7*^^ are the subsets of M containing all the 
predicates of the form p*^^ and p'^^, respectively. At this point, the reader may check that rules in C{V)^ 
have again the form illustrated in points A — E above. 

Given that M is a minimal model of C{V)^^ , it follows that /'^^ is a minimal model of (P''''[Cl] U 
{obs^^ I ObSi ^ W})^ = (P(P)w[Cl])^''\ and I^^ is a minimal model of {P'^^[C2] U {obs?^ | 
ObSi ^ (W U O)})^ = {P{V)yv,o[C2]y^\ Then, Mi = {a | a^^ G I^^} is aminimal model of P^^^ 
and M2 = {a I a"^^ G l'-'^} is a minimal model of P-^'^q- Therefore, Mi (resp., M2) is the stable model 
of Pyv (resp., Pw,o)- 

To conclude, the following properties can be shown: 

Proof. Assume, for the sake of contradiction, that there is a fact obSi in W such that P{V)w \= 
obSi. Then, obsi must belong to the unique stable model Mi, and obs^^ must belong to I^^. 
However, Wi belongs to M by construction, and therefore by rule 5. (a) badCl is in M, which is 
impossible. 

• P{V)w,o ^ -W. 

Proof. Assume, for the sake of contradiction, that for each fact obsi in W, Pi'P)w,o \= ~'obSi. 
Then, the unique stable model M2 does not contain any fact in W, and symmetrically I*-^^ does not 
contain any fact of the form obsj^, with obSi in W. Then, satC2 is not in M because of the rule 
5.(b), which is impossible. □ 



Minimum-size Outliers In order to translate detection problems aiming at singling out minimum-size 
outliers into a suitable logic program, an approach will be exploited which was used, for instance, in the 
DLV system and relying on extending classic logic programming by introducing weak constraints. 

Weak constraints, taking the form of rules such as :~ bi, • • • ,bk,not b^+i, • • • ,not bk+m, express a 
set of desired conditions that may be however violated; their informal semantics is to minimize the number 
of violated instances. In fact, in [17] it is proved that the introduction of weak constraints allows the solu- 
tion of optimization problems since each weak constraint can be regarded as an "objective function" of an 
optimization problem. 

Given a program P UW where P is a set of rules and W a set of weak constraints, an interpretation M 
is a stable model for P U ll/' if M is a stable model for P. The stable models of P UW are ordered w.r.t. 
the number of weak constraints that are not satisfied: best stable models are those which minimize such a 
number [17]. 
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Example 6. Given a graph G = {V, E), denoted by the unary predicate node and the binary predicate edge, 
we can model the MAX_ CLIQUE problem, asking for the clique of G having maximum size, by means of the 
following program: 

c(X) ^ not nc(X), node(X). 

iic(X) ^ not c(X), node(X). 

p c(X), c(Y), X 7^ Y, not edge(X, Y), not p. 

:~ nc(X). 

where the first two rules are used for creating all the possible partitions of nodes into c and nc, the third one 
is used for ensuring that nodes in c forms a clique, i.e., each pair of nodes in c is connected by an edge, while 
the weak constraint minimizes the number of vertices that are not in the cUque, or equivalently it maximizes 
the size of the clique. Then, the best stable models are in one-to-one correspondence with maximum-size 
cUques in G. < 

Thus, the algorithm in Figure 5 can be modified by inserting the constraint :~ oi. into C{V), for each 
obsi G pobs fhen, letting C"{V) be the transformed program resulting from applying the modified algo- 
rithm, we have that minimum-size outliers in P are in one-to-one correspondence with best stable models 

of c^{r). 

Theorem 22. LetV = (p'^is .pobs^ ^ stratified rule-observation pair Then, 

1. for each minimum-size outlier O with witness W in V, there is a best stable model M. of C^{V) such 
that {obSi I Oi G M} = O and {obSi | Wi G M} = W, and 

2. for each best stable model M. of C^{V), there is an outlier O with witness W in V, such that {obsi | 
Oi G M} = O and {obSi | Wi G M} = W. 

Proof. Given a rule-observation pair V, C{V) = C{V) U {:~ Oi. | obSi G P°^^}, where the stable 
models of the program C{V) are in one-to-one correspondence with outliers in V (Theorem 5). 

By definition, each model M in SM.{C{V)) is also a model of C"{V). Moreover, M is a best model 
if it minimizes the number of violated constraints, i.e., if there is no other model M' containing a fewer 
number of atoms of the form Oi. The result follows by noticing that this number is, in fact, the size of the 
outlier O constructed in the proof of Theorem 5. □ 



6.2 General Pairs 

Next let us consider the case of general rule-observation pairs. In this case, it should be pointed out the con- 
straint imposed by rule 6.(c) would not suffice for ensuring the satisfaction of condition (1) in Definition 1. 
Indeed, the outlier detection problems turned out to be complete for the second level of the polynomial hi- 
erarchy for general rule-observation pairs, while any polynomial time transformation into a logic program 
under stable model semantics may encode problems complete for the first level only. 

In order to deal with this problem, a more powerful logic formalism must be exploited, while keeping the 
size of the encoding polynomially-bounded in the size of the rule-observation pair. Specifically, the solution 
relies on a rewriting into a disjunctive logic program accounting for outliers under the cautious semantics. 
Actually, a similar rewriting for brave semantics can be obtained as well. 

Recall that disjunctive programs allow clauses to have both disjunction (denoted by V) in their heads 
and negation in their bodies. In more detail, a disjunctive rule r is a clause of the form: ai V • • • V am 
bi, ■ ■ ■ )bk,not ci, ■ ■ ■ , not Cn where n,k,m > Q, n + k + m > and ai, • ■ • , am,bi, • • • , b^, ci, ■ ■ ■ , Cn 



42 



are atoms. The disjunction V • • • V am, also denoted by h(r), is the head of r, while the conjunction 
bi, . . . , bk, not ci, • • • , not Cn, also denoted by b(r), is the body of r. 

Given an interpretation / for a disjunctive program P, the value of the disjunction = ai V • • • V a^ 
w.r.t. I is valuei{D) = max{{valuei{a.i) | 1 < i < n}). Similarly to the case of non-disjunctive programs, 
a ground rule r is satisfied by / if valuei{h{r)) > valuej{h{r)) and models of P are defined to be the 
interpretations that satisfy all the ground rules of P. Then, the model-theoretic semantics of disjunctive 
programs is defined as follows. 

For a positive disjunctive program P (i.e., rules in P do not contain negation in the body), the semantics 
is given in terms of the set its minimal models, denoted by M.M{Py . Moreover, for a general disjunctive 
program P, the stable model semantics [42] assigns to P the set SM.{P) of its stable models defined as 
suitable extension of stable models for disjunction-free programs. In particular, let P be a disjunctive logic 
program and let I be an interpretation for P. Then, P^ denotes the ground positive program derived from 
ground{P) by (1) removing all rules that contain a negative literal not a in the body and a G /, and 
(2) removing all negative literals from the remaining rules. An interpretation M is a stable model for P if 
and only if M G M.M.{P^). Under this semantics, disjunctive programs allow to solve problems that are 
complete for the complexity class (see, e.g., [24]). 

The algorithm OutlierDetectionToDisjunctiveStableModels is shown in Figure 6. To illustrate, step 2 
inserts two suitable rewriting of P'^^ into jOy{P). One of the two rewriting consists of a renaming of the 
original program P^^, call it P'^'®[C2] (see rules 2.(b)). Analogously to the rewriting shown for stratified 
pairs, P'''^[C2] serves the purpose of checking condition (2) of Definition 1. 

Instead, the task of checking condition (1) is demanded to rules inserted in Steps 2.(a), 4.(b), 7, 8, and 9 
— recall that, under the cautious form of entailing, this check consists in verifying that each stable model of 
P(P)w does not contain some atom in W. Intuitively this is carried out as follows. First, in order to encode 
the stable models of the program P{V)\^, for each atom p occurring in P{V), the atoms p'^^ and p*^^ are 
used: the atom p*^^ (resp. p*^^) being true in an interpretation of £^(P) means that the atom p is true (resp. 
false) in a stable model of P(P)w. Then, the atom satCl is used to check whether the truth values for 
the atoms of the form p^^ and p'^^ correctly encode a stable model for P{V)yv and satisfy condition (1) in 
Definition 1. Specifically, by rule 2.(a), satCl is entailed by the rewriting as soon as the atoms of the form 
p^^ and p*^^ do not satisfy some rule in P(P)w. Clearly, this is only a necessary condition for these atoms 
encoding a stable model for P(P)>v, and details on the encoding of some further conditions are provided 
below. 

Steps 3 and 4 of algorithm OutlierDetectionToDisjunctiveStableModels are similar to Steps 3 and 4 of 
algorithm OutlierDetectionToStableModels described above. Specifically, rule 4.(b) entails sat CI whenever 
an atom of the form obs^ comes true in the stable model of P{V)\^ while not being part of the witness 
set. Therefore, this rule guarantees that the truth values for the atoms of the form p^^ and p^^ consistently 
encode the observations that do not belong to the witness set. 

Rules inserted in subsequent Step 5 evaluate the conditions of Definition 1. Indeed, the atom satC2 is 
true if a fact obs^ is true even if assumed to belong to a witness (Wi true), i.e. if condition (2) in Definition 1 
is satisfied in the model, while the atom satCl is true if, for each fact obsi G pobs^ either (step 5. (a)) obsi 
is not assumed to belong to a witness (wi true) or (step 5.(b)) obsi is assumed to belong to a witness (wi 
true) and obs^ is true, i.e. if the witness set is not entailed in the model (cf. condition (1) in Definition 1). 

^ Differently from disjunction-free positive programs, positive disjunctive programs have more than one minimal model. Hence, 
for simpUcity and by a little abuse of notation, in the following MM{P) will denote a set of minimal models rather than a 
single minimal model. 
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Input: A rule-observation pair V = po^"*), where P°^^ = {obsi, obsn}; 

Output: A disjunctive logic program £^ {V) ; 
Method: Perform the following steps: 

1. C{V) := 0; 

2. /* Rule part rewriting */ 

for each rule r e P"^'^ of the form a <— bi , • • ■ , bit, not ci, • • • , not Cm, insert into C^(V) the rule 

(a) satCl ^ a'^^ b^^ ■ ■ ■ , bj;^ ■ ■ • , c^^ 

(b) a':^ ^bf ,---,bf ,notcf,---,notcS^ 

3. /* Outlier and witness guessing */ 

for each obsi G pobs^ insert into £^ (V) the rules 

(a) Oi <— not Oi . Oi ^ not Oi . 

(b) Wi ^ — not Wi . Wi < — not wi . 

4. /* Observations definition */ 

for each obsi G P"^", insert mto (V) the rules 

(a) obs?^ <— not oi , not wi . 

(b) satCl <— obs^ ,Wi. 

5. /* Outlier and witoess checking */ 

for each obsi G P°''^ insert into £^ (V) the rules 

(a) satCli ^ Wi . 

(b) satCli ^wi.obSj . 

(c) satC2 ^ Wi, obs^^. 

insert into £^(P) the rule satCl <— satCli, . . . , satCln. 

6. /* Constraints */ 

(a) insert into {V) the rule satCl ^ not satCl. 

(b) insert into C'"^ [V) the rule satC2 ^ not satC2. 

(c) for each obSi G P°''^ insert into (P) the rule s * — Oi, Wi, not s. 

7. /* Checking Condition (1): guessing an interpretation, mapping cp and rule assignment */ 

for each atom p G P'^''' U P°''^ insert into £^('P) the rules 

(a) p"Vp=^ 

(b) pcj)'- V ... V pif>^ «— p'^-'. (where s is the number of predicate symbols in Lit(P'^''') U P°t>s) 

(c) pr ' V . . . V pr* ^ p'^^ . (where r , . . . , are the rules in P'^^^ U pobs jjj Yj,hich p occurs in the head) 

8. /* Checking Condition (1): constraints */ 

for each atom p in P"^'= U P"^" insert into C^iV) the rules 

(a) satCl ^ p°Sp^^ 

(b) satCl <- p4>^ , p</>j , p'^^ . (for each i, j e {1, s}, with i ^ j) 

(c) satCl <— pr^,prj,p'^^ (for each i,j G {1, ...,^}, with i ^ j) 

(d) satCl <— pr^ , Wj , p'^^ . (if is a fact in pobs asserting the atom obs j = p) 

(e) satCl ^ pr^ , c'^-' , p'^-' . (for each atom c occurring negatively in the body of r^) 

(f) satCl ^ pr^ , b'^' , p'^^ . (for each atom b occurring positively in the body of r^) 

(g) satCl ^ Tpr^ , p(fi^ , qcf>*^ , Tp^'- . (for each atom q occurring positively in the body of r^, and for each h < k) 

9. /* Checking Condition (1): saturation */ 

for each atom p in P' '= U P°^^ insert into £^ (V) the rules 

(a) p*^^ ^ satCl. 

(b) p'^^ ^ satCl. 

(c) p</>^ ^ satCl. (for each i G {1, s}) 

(d) pr^ ^ satCl. (for each rule in which p occurs in the head) 
insert into £^ (V) the rule 

(e) satCli <— satCl. (for each atom of the form satCli) 



Fig. 6. Algorithm OutlierDetectionToDisjunctiveStableModeh. 
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Step 6 add rules which are similar to the constraints of the algorithm in Figure 5. In fact, these rules 
impose that the interest is in stable models in which both satCl and satC2 are true, and that obsi cannot 
belong to an outlier and a witness at the same time. 

The main differences w.r.t. the case of stratified rule-observation pairs are in steps 7, 8 and 9. Indeed, 
unlike stratified logic programs, general logic programs may have more than one stable model and, hence, 
under cautious semantics, rules inserted in Step 5 do not suffice to check condition (1) in Definition 1. 

Specifically, rules inserted in Step 9 are such that if a stable model M of contains satCl, then it 

must also contain all the atoms of the form p*^^ and p*^^ — actually, these rules infer also other atoms, namely 
satCli, and pr*, whose necessity will be clear in a while. Intuitively, since by rule 6.(a) satCl must 
belong to any model of the program jC'^{P), a necessary condition for M to be a minimal model of [P]^ 
is that, for each subset M' of M being a model of C^{P)^ , M' entails satCl in its turn. Due to the rules 
inserted in step 7. (a) and 8. (a), each model M' C M contains a guess of a model of P{V)y\). Hence, by 
looking at rule 2.(a) and rules in Step 5, it would be concluded that the minimahty of M guarantees that 
there is no model of P('P)w that does not satisfy condition (1). 

However, the check that the models of P('P)>v do not satisfy condition (1) must be restricted to its stable 
models only (ignoring models that are not stable and that do not satisfy condition (1)). This is precisely the 
purpose of the rules in Step 7 and 8. Specifically, the intended meaning of the rules 7.(b), 7.(c) and the rules 
inserted in Step 8 is to check for the minimahty of M' (w.r.t. {P)^) so that satCl is entailed whenever 
M' is not minimal. To this aim, a well-known characterization of minimal models for positive programs is 
exploited (indeed, the reduct of P{V)y\; w.r.t. an interpretation is a positive program), which is formahzed 
below for the reader's convenience. 

Lemma 1. (cf. [88], Theorem 2.7) Let P be a (non-disjunctive) positive propositional logic program, and 
let M be a model for it. Then, M is minimal if and only if there is a function 4> assigning a natural number 
to each atom occurring in P, and a function r assigning a rule of P to each element in M such that: 

1. h{r{p)) C M, 

2. h(r(p)) = p, and 

3. (j){q) < (f){p), for each q G h{r{p)). 

Accordingly, to assess the minimality of the model at hand, rules 7.(b) and 7.(c) guess for each atom p, 
an assignment to a natural number (p is assigned to i iff p0^ is true in the model), and a rule (exactly a rule 
rJ having p occurring in its head is assigned to p iff pr^ is true in the model), while rules added in Step 8 
checks whether the assignments are correct, i.e., whether they satisfy all the conditions in Lemma 1. 

The following theorem accounts for the correctness of the algorithm. 

Theorem 23. Let V = {V^^\ T"^"") be a rule-observation pair, and let (V) be the rewriting obtained by 
the algorithm in Figure 6. Then, 

L for each outlier O with witness W in V, there is a stable model M. of {V) such that {obsi | Oi G 

M} = Oand {obsi | wi G M} = W, and 
2. for each stable model M of C"^ iV), there is an outlier O with witness W in V, such that {obsi | oi G 

M} = Oand {obsi | wi G M} = W. 

Proof 

1. Let O be an outlier with witness W in V. Let M2 denote the stable model of the program P('P)w,o 
which entails an element in W (notice that such a model exists in order to satisfy condition (2) in 
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Definition 1). Consider the interpretations I^^ = {a'^^,a^^ | a is an atom in P'''*' U P"*^*^}, I^'^ = 
{a^2 I a e M2}, J5"^^^ = {Oi I obSi G O} U {oi | obSi ^ O} U {wi | obSi G W} U 
{wi I ObSi ^ W}, 1°^' = {obsf I ObSi ^ {O U W)}, = {p(/)^ | p is an atom in P^^^ U 
pobs^ 1 < i < s} (where s denotes the number of distinct predicates in P^^^ U P°^^), and P = {pr^ | 
rJ is a rule in P^'^" U P"'"'' such that h(rJ ) = p}, and let M = /"^^ U I^"^ U /^'"^^'^ U I"''^ U /"^ U P U 
{satC2, satCl, satCli, satCln} be an interpretation of £y{V). 

Now, it is shown that M is a stable model of {V) such that {obSi | Oi G M} = O and {obSi | Wi G 
M} = W. To this aim, notice that by construction: 

- {obSi I Oi G M} = O, because Oi is in M if and only if obSi is in O; and 

- {obSi I Wi G M} = W, because Wi is in M if and only if obSi is in W. 

Therefore, it remains to show that M is a stable model of C^{V), i.e., that it is the minimal model of 
the positive program {V)^ . To this aim, let P^^ denote the program composed by rules added in 
steps 2. (a) and 4.(b) of the algorithm, and let P* denote the program consisting of the rules added in the 
i-th step of the algorithm, and observe preliminary that {V)^^ is the program (P'-^^)^^U (P''^'*[C2] U 
{obsf I ObSi (O U >V)})^U {P"^)^^ U {P'')^^ U (P6)^ U (P^)^ U (P8)^ U (P9)^, where (by 
applying the definition of reduct): 

^ . |-pCl^M _ pCl 

Indeed, by definition P*^^ is a positive program, and therefore coincides with its reduct. 

B : {P'^'[Q2] U {obsf | obSi (WU O)})^ = (P(P)w,o[C2])^'''. 

Indeed, by definition, P{V)w,o['^'A is the program P''i^[C2] U {obsf | obSi ^ (W U O)]. More- 
over, all the predicates in this program have the form p^^, where p is a predicate symbol in P(P). 
Therefore, (P(T')w,o[C2])^^ = (P(P)H;,o[C2])^'''u{obsf |obsi0(wuo)}^ the result follows be- 
cause {obSi^ I obSi ^ (W U O)} is a subset of 7*^^. To see why the last containment hold, recall 
that i*-^^ is a renaming of the model M2, which must contain all the observations that are not in 
(W U O), by construction. 

C : (p3)M = {p. I p is an atom in la^ess^^ 

D : (P5)j^ = P5. 

E : (PY^ = {s ^ Oi,Wi. I ObSi e P°^}. 

F : (P7)A'/ = p7_ 

G : (P8)^ = P\ 
H : (P9)^ = P9. 

Then, the result follows because of the following two properties: 
Property Pi : M is a model of (P) ^ . 

Proof. All the rules in {V)^ (see points A — H above) are satisfied by M: 

- Rules in A are satisfied by M, as satCl is in M. 

- Rules in B are satisfied by M. Indeed, program P(P)w;^0[C2] coincides with P{V)y\!,o modulo 
a renaming of the predicate symbols, and J*-^^ is in fact a renaming of the unique stable model M2 
of P(P)vv;,c>, by construction. Then, the result follows since M2 is, by definition, the minimal 
model of the positive program P{V)^q. 

- Rules in C are satisfied by M since 19'^'^^^ is a subset of M. 

- Rules in D are satisfied by M since satC2, satCl, satCli, satCln are in M. 

- Rules in E are satisfied by M. Indeed, the set /f is such that Wi is in M if and only if 
obSi G yy, and such that Oi is in M if and only if obSi O. Then, given that by definition 
yy n O = 0, the body of such rules is always evaluated false in M. 
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- Rules in F are satisfied by M since the head of each of these rules belongs to /*-^^ U I*^ U by 
construction. 

- Rules in G are satisfied by M since satCl is in M. 

- Rules in H are satisfied by M since the head of each of these rules belongs to /*-^^ U I*^ U U 
{satCli, satCln}, by construction. 

Property P2: There is no model M' of C^{V)^ such that M' c M. 

Proof. Recall that program {V)^ has the form shown in points A — H above and assume, for the 
sake of contradiction, that there is a model M' C M for {V)^ . First, observe that F"^^^ C M' 
(see point C). Moreover, because of the fact that {obs^ | obsi (W U O)} is a subset of I^^ 
(point B, above), it can be observed that M has in fact the form M = I^^ \J I^"^ U U /"J^ U 

r U {satC2, satCl, satCli, satCln}. 

Assume now that I^"^ n M' C I^'^. Then, I^"^ n M' is a model for (P'''^[C2] U {obsf | obsi ^ 
(O U W)}y''\ i.e., for (P(P)w,o[C2])^'''. It follows that the set = {a | a^^ ^ (jC2 ^ ^/)| 
is also a model for P(V)^a (notice that (P(P)w,e)[C2])-^^'' coincides with modulo the 

renaming of predicate symbols). However, M2 is also a subset of M2, and this is impossible because 
M2 is the stable model of P{T')yv,o and, therefore, the minimal model of P{V)^q. 
Therefore, I*-^^ C M' . Consequently, M' C M implies by construction of the program C^{V) 
(rules in Step 9) that satCl is not in M' . Then, by carefully looking at rules in Step 8, the following 
conclusion is reached: 

(a) I^^ n M' does not contain a pair of atoms of the form p'^^ and p*^^ (rules 8.(a)) and each atom p 
occurring in P^^^ U P°^^ contains at least a fact in {p^^,p^^} (rules 7.(a)). In the following, let 
Ml denote the set {a | a*^^ € [I^^ n M')}. 

(b) For each atom p in Mi, exactly one atom of the form p^^ is in M' (rules 8.(b)). Thus, a mapping 
(f) can be defined from atoms in Mi to natural numbers such that <?!>(p) = i iff both p"^^ and p^^ 
are in M' . 

(c) For each atom p in Mi, exactly one atom of the form pr^ among those having p in their head 
is in M' (rules 8.(c)). Thus, an assignment r can be defined from atoms in Mi to the rules of 
pris yj pobs gypjj Uj^j ^i^p^ _ \)o\h p'^^ and pr^ are in M' and p occurs in the head of . 

(d) For each atom p in Mi, the assignment r defined in point (c) above is such that: 

i. r(p) is not a fact in P°^^ belonging to W (rules 8.(d)), i.e. p cannot be entailed by exploiting 
a fact removed by P°^^ since it is part of the witness W; 

ii. r(p) is a rule which does not contain an atom c in the body, where c is in Mi (rules 8.(e)), 
i.e. p cannot be entailed by exploiting a rule which is not in the reduct (p^is^Mi gjjjQg there 
is a negated atom c in its body which is true in Mi ; combined with point (i) above, these two 
constraints impose that r is an assignment of atoms to rules in P{V)^^ . 

iii. b(r(p)) C Mi (rules 8.(f)); 

iv. 4>{q) < (t>{p), for each q occurring positively in b(r(p)) (rules 8.(g)). 

Armed with the properties above, it can be concluded that M' defines a mapping (f) from Mi to rules 
in P{V)^ (with an associated mapping r for the atoms to the rules) such that all the conditions 
in Lemma 1 are satisfied. Furthermore, by rules 2. (a) and 4.(b), Mi is a model for the program 
P{V)y\!, otherwise satCl would be entailed. Thus, Mi is trivially a model for PiV)^^ and, by 
virtue of Lemma 1, it is in fact a stable model for P(7')w- However, such a stable model witnesses 
that condition (1) in Definition 1 is violated, because of the rules in Step 5 and the fact that sat CI 
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is false in M', which entails the existence of an observation in the witness set W which is in Mi. 
Contradiction. 

2. Let M be a stable model of C^{V). First of all, note that by rules inserted into {V) in Step 6 of the 
algorithm, M is such that (i) for each letter obsi in p°^^^, Oi and wi cannot belong simultaneously to 
M, (m) satC2 G M, and {in) satCl G M. Furthermore, by rules inserted into C'^{V) in Steps 3.(a) 
and 3.(b) it is the case that, for each letter obsi in P^^^, either oi or oi and either Wi or Wi belong to M. 
Consider, now, the disjoint sets O = {obsi | oi G M} and W = {obsi | wi G M}. It will be shown 
that both conditions in Definition 1 are satisfied. To this aim consider the interpretations I^^, ja^ess^ 
and r defined in point (1) of this proof, and notice that M can be written as I^^ U I^"^ U i^"^** U /"^ U 
F U {satC2, satCl, satCli, satCln}, where /*^^ is the subset of M containing all the predicates 
in M of the form p*^^ and p*^^. Indeed, since satCl is in M, the model must contain also the elements 
in l'^^ U 7*^ U U {satC2, satCli, satCln}. At this point the reader may check that the rules in 
C^iV)^ have again the form illustrated in points A — H above. 
To conclude, the following properties can be shown: 



Proof. Given that M is a minimal model of {V)^ , it can be concluded that I^"^ is a min- 
imal model of {P'^^[Q2] U {obsf | obSi (W U O)})^ = (P(7^)w,o[C2])^'''. Then, 
M2 = {a I aC2 g jC2| ^ minimal model of {P{V)w,o)^^- Now, it can be shown that 
P{V)w.o "'VV. Indeed, assume, for the sake of contradiction, that -P(7^)w,e' |=c "'VV. Then, 
the stable model M2 does not contain any fact in W, and symmetrically I*^^ does not contain any 
fact of the form obs^^, with obsi in W. Then, satC2 cannot be entailed by M, which is impossible. 



Proof. To this aim, observe that M is a minimal model of C^{V)^ and assume, for the sake of 
contradiction, that there is a fact obsi in W and a stable model Mi for P{V)yv such that obsi is in 
Ml. Then, given /^^ = {a.^^ \ a G U {a'^^_ | a Mi }, the set M' = l'^^ U /^^ y y y jr- 
is also a model for [V)^ , where /"^ and P encode the assignments 4> and r in Lemma 1. Since 
M' c M, there is a contradiction with M being a stable model of {V) □ 

This section concludes by observing that minimum-size outlier detection problems can be modelled 
for general rule-observation pairs by exploiting the same approach used for the stratified pairs, i.e., by 
introducing weak constraints in the program built by the algorithm in Figure 6. Specifically, given a pair 
V, the algorithm can be modified by inserting into {V) a constraint -.^ oi. for each obsi G P°^^. 
Then, letting C^'"" {V) be the transformed program, the same arguments as Theorem 22 with respect to the 
construction of Theorem 23 can be used for showing that minimum-size outliers in V are in one-to-one 
correspondence with best stable models of C'^''"{V). 

Theorem 24. Let V = (p^is ^obs^ ^ rule-observation pair Then, 

1. for each minimum-size outlier O with witness W in V, there is a best stable model M. of C^'^iJ^) such 
that {obSi I Oi G M} = O and {obsi | wi G M] = W, and 

2. for each best stable model M. of {V), there is an outlier O with witness W in V, such that {obsi | 
o^eM} = Oand {o^bs^ | wi G = W. 



48 



7 Related Work 



In this section, tlie most relevant related work is discussed. Research work related to that presented in this 
paper can be roughly divided into two groups: (i) work done on outlier detection from logic theories, which 
is very relevant to our own, and (ii) work done on outlier detection from data, which is, on the contrary, less 
related to concepts discussed in this paper. 

7.1 Outlier detection from data 

Outlier detection problems come in several different varieties within different settings, mainly investigated 
in the area of statistics, machine learning and knowledge discovery in databases. In almost all cases the 
presented approaches deal with data organized in a single relational table, often with all the attributes being 
numerical, while a metrics relating each pair of rows in the table is required. 

The approaches to outlier detection can be classified as supervised-leaimng based methods, where each 
example must be labelled as exceptional or not [60, 77], and the unsupervised-leaming based ones, where 
such labels are not required. The latter methods are obviously more general than the former ones. As the 
technique proposed in this work is unsupervised, the sequel of this section shall refer only to unsupervised 
methods. In their turn, unsupervised-leaming based methods for outlier detection can be categorized in 
several groups. 

The first group is that of statistical-based methods that assume that the given data set has a distribu- 
tion model. Outliers are those objects that satisfy a discordancy test, i.e., which are significantly larger (or 
smaller) in relation to the hypothesized distribution [11]. 

Deviation-based techniques identify outliers by inspecting the typical characteristics of objects and con- 
sider an object that deviates from these features an outlier [10]. 

A completely different approach that finds outliers by observing low dimensional projections of the 
search space is presented in [3]. Here, a point is considered an outUer if it deviates from the other data in 
some subspace. 

Further groups consists of density-based techniques [16], using a notion of local outlier that measures 
the plausibility for an object to be an outlier with respect to the density of the local neighborhood. 

Distance-based outlier detection was introduced by Knorr and Ng [55] to overcome the Umitations of 
statistical methods. A distance-based outlier is defined as follows: A point p in a data set is an outlier with 
respect to parameters k and S if at least k points in the data set lie at a distance greater than 5 from p. This 
definition generalizes the definition of outlier in statistics and is well suited when the data set does not fit 
any standard distribution. Ramaswamy et al. [73] modified the above definition of outlier, since that does 
not provide any ranking for outliers that are singled out. The new definition is based on the distance of the 
k-th nearest neighbor of a point p, denoted with D^{p), and it is as follows: Given k and n, a point p is an 
outlier if no more than ra — 1 other points q in the data set have a higher value for D'^ [q) than p. This means 
that the points q having the n greatest D^{q) values are considered outUers. In [8] a new definition of outlier 
that considers for each point the sum of the distances from its k nearest neighbors is proposed. The authors 
presented an algorithm using the Hilbert space-filling curve that exhibits scaling results close to linear. An 
analogous definition of outlier based on the fe-nearest neighbors has been used in [32] for unsupervised 
anomaly detection in intrusion detection appUcations. In [12] a near-linear time algorithm for the detection 
of distance-based outhers exploiting randomization is presented. 

The general differences and analogies between the approaches described above and our own should be 
clearly understood. In fact, those approaches deal with "knowledge", as encoded within one single relational 
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table that is, in a sense, flat, i.e. such that there is no explicit relationship linking the objects (tuples) of the 
data set under examination. Vice versa, the technique proposed in this work deals with complex knowl- 
edge bases, which may well comprise relational-like information, but generally also include semantically 
richer forms of knowledge, such as logical rules: in this latter case several complex relations relating ob- 
jects (atoms) of the underlying theory might be explicitly available. As a consequence, even if the intuitive 
and general sense of computing outliers in the two contexts is analogous, the conceptual and technical de- 
velopments are quite different in the two contexts, just as the formal properties of computed outUers are 
different. 

7.2 Outlier detection using logic 

Recently, outlier detection has emerged as an interesting knowledge representation and reasoning problem, 
in the context of default logic [7]. 

In particular, [7] originally introduced and investigated the concept of outher detection in the context 
of default logics. In that context, we have a propositional default theory A = {D, W), where D is a set 
of defaults and is a set of propositional formulas. An outlier is then defined as a literal in W which is 
not justified in A with respect to a witness S C W.ln that paper, the complexity of singling out outliers 
was studied for several fragments of Reiter's propositional default logics [75], that is, general propositional 
theories, disjunction-free theories, normal mixed unary theories, unary and dual unary theories, and acyclic 
unary and acyclic dual unary theories. The complexity results range from P to S3 . In their analysis, the 
authors investigated only the cautious form of reasoning and defined an outlier as a singleton set, thus the 
concept of minimality has no meaning in that setting. 

In this paper, the paradigm of [7] has been extended and generalized in several respects by (/) adapt- 
ing the notion of outlier to the paradigm of logic programming in such a way that outliers are no longer 
consttained to denote single individuals, (//) investigating the complexity of the corresponding detection 
problems, (Hi) considering also the brave form of reasoning, (/v) defining significant minimization-based 
outlier detection problems and studying their complexity, (v) extending the framework to have observations 
encoded as a logic theory, and, finally, (vi) providing rewriting techniques for effective outlier detection 
implementation. 

In conclusion, let us observe that the class of default theories studied in [7] most related to general 
logic programs is that of disjunction-free theories (DF). Indeed, it is well-known [42] that a general logic 
program P can be translated into an equivalent DF default theory A{P). However, the complexity results 
in [7] cannot be sttaightforwardly ttanslated to the case of theories expressed as logic programs, even in the 
restricted framework of cautious reasoning for detecting outliers as individual elements. 

Indeed, the hardness results in [7] for DF default theories rely on normal theories, whose relationships 
with logic programs has been not studied in the literature so far. And, in fact, there seems to be not much 
sense in translating a logic program into a normal default theory. Even a simple program like a <— not f3 
cannot be translated into normal defaults because some form of (3 is needed in the justifications. A form that 
would properly translate into normal is cc /3, not -la, and this requires classical negation. Nevertheless, 
if one has a program with no negation of any form, one can translate it to normal: a ^ /? can be translated 
to Since there is no negation, the extension one obtains will be equivalent to the stable model. 

Furthermore, since normal default theories satisfy the semi-monotonicity property, it is not clear how 
to design a polynomial-time translation from a logic program P under stable model semantics to a default 
theory A{P) ensuring abijection between stable models of P and extensions of A{P). 
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8 Conclusion 



In several knowledge-based applications, the scenario is significant where a rational agent equipped with 
his own trustable knowledge about the world has some information coming from the outside in the form of 
a set of observations denoting the status or facts about the external environment. Then, it is useful to let the 
agent be capable of discovering those observations (if any) whose truth clashes to some extent with some 
of the agent trustable beliefs. Such observations, called outliers, do indeed embody some abnormal status of 
things which is interesting, at least, to be singled out and checked. 

Differently from what is most often done in the literature, outliers have been defined in the paper on 
the basis of some logical properties, rather than being determined by statistical characteristics. In particular, 
by extending the work presented in [7], the concept of outlier has been formalized in the framework of 
logic-programming based knowledge systems under both cautious and brave stable model semantics. 

It has been shown that for a fact being an outlier depends on the given observation context: a fact may 
well be an outlier within a given observation set, while being normal in some other one. In words, it has 
been illustrated that outliers can be detected on the basis of observations to hand, by eventually singling out 
some properties standing out for their abnormality. In the proposed setting, it is in fact necessary to single 
out a supporting set, called the witness set, for the outlier to be singled out in turn. 

The aim of the paper has been that of formalizing several variants of outlier detection problems, of 
showing their complexity and providing algorithms for outlier detection via rewriting. 

The outlier detection framework introduced here resembles such important notions in AI such as diagno- 
sis and abductive reasoning and belief revision. Still, several important differences are there that confirm the 
usefulness of such novel formalization: belief revision, abduction and other reasoning task related to outlier 
detection have been comparatively discussed in the paper, where such differences have been highlighted. 

Complexity figures obtained for outlier detection problems are summarized in Figure 3 and 4 referring, 
respectively, to the basic and the min-cost variant of the problems. The complexity results show that outlier 
detection can be sometimes easy but it is intractable in most cases. 

An interesting scenario, which has been analyzed as well in the paper, occurs when one focuses on the 
complexity of the problem where only the observation component is assumed to vary (that is, to denote the 
problem input), whereas the rule component is assumed to be fixed. This is the complexity measure known 
as data complexity, and is relevant whenever the size of the evidence data is large as compared to the size of 
the knowledge base formalizing his expected properties. 

Moreover, the case where the observations are encoded as a logical theory, rather than "simple" sets of 
facts, has been considered. The concept of outlier has been properly generalized to arrange this idea, and the 
computational complexity of the problems arising in this extended setting has been also accounted for in the 
paper. 

Finally, sound and complete translations of outlier problems have been provided into equivalent infer- 
ence problems under stable model semantics. These translations can be taken advantage of in order for 
effectively implementing outUer detection on top of any available stable model solver (e.g., [49, 61, 67, 63]). 

Several research questions are left often by the paper. As far as the basic definition of outliers (and 
associated witness) is concerned, it might be investigated whether there are some alternative notions that 
appear convenient for modelling abnormality in the observations to hand in some specific kinds of applica- 
tion. As an example, one may study a notion where condition (1) in Definition 1 is provided according to 
the brave (resp., cautious) semantics, while condition (2) is provided according to the cautious (resp., brave) 
semantics, and investigate how this notion compare with the one proposed in the paper. 
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As far as the complexity studies are concerned, it is believed to be relevant to have the picture proposed 
in the paper completed by investigating the combined complexity of detection problems for first-order pro- 
grams, i.e., to study a setting where both the rule and the observation component are considered part of the 
input problem. 

Finally, from the appUcation side, it would be interesting to study whether logic-based outlier detection 
may have some fruitful appUcations in the database context, e.g., whether it can support the handhng of 
sophisticated kinds of constraint or whether it can be even practicable as a data mining technique. Indeed, 
even though outlier detection problems appear to be intractable in most cases (unless P = NP), it might be 
still the case that suitable algorithms, approximations and heuristics can be defined to cope with them in an 
efficient way. 

Acknowledgments 

Thanks to Rachel Ben-Eliyahu-Zohary for fruitful discussion on the subjects of this paper and for having 
originally set up the outUer detection scenario. 

References 

1. S. Lonardi A. Apostolico, M.E. Bock and X. Xu. Efficient detection of unusual words. Journal of Computational Biology, 
7(l/2):71-94, 2000. 

2. Serge Abiteboul, Richard Hull, and Victor Vianu. Foundations of Databases. Addison Wesley Publ. Co., Reading, Massachus- 
setts, 1995. 

3. C. C. Aggarwal and P.S. Yu. Outlier detection for high dimensional data. In Proc. of the 2001 ACM International Conference 
on Management of Data (SIGMOD), pages 37^6, Santa Barbara, California, USA, 2001. 

4. C.E. Alchourron, P. Gardenfors, and D.Makinson. On the logic of theory change: Partial meet contraction and revision func- 
tions. Journal of Symbolic Logic, 50(2):5 10-530, 1985. 

5. C.E. Alchourron and D. Makinson. The logic of theory change: Contraction functions and their associated revision functions. 
Theoria, 48:14-37, 1982. 

6. C.E. Alchourron and D. Makinson. On the logic of theory change: Safe contraction. Studia Logica, 44:405^22, 1985. 

7. F. AngiuUi, R. Ben-Eliyahu-Zohary, and L. Palopoli. Outlier detection using default logic. In Proc. of the 18th Interna- 
tional Joint Conference on Artificial Intelligence (IJCAI), pages 833-838, Acapulco, Mexico, 2003. Extended technical report 
available for download at http://dns2.icar.cnr.it/angiulli/papers/ijcai2003fuU.pdf. 

8. F. Angiulli and C. Pizzuti. Outlier mining in large high dimensional data sets. IEEE Transactions on Knowledge and Data 
Engineering, 17(2):203-215, 2005. 

9. Marcelo Arenas, Leopoldo E. Bertossi, and Jan Chomicki. Consistent query answers in inconsistent databases. In Proc. of the 
18th ACM SIGACT SIGMOD SIGART Symp. on Principles of Database Systems (PODS), pages 68-79, 1999. 

10. A. Arning, C. Aggarwal, and P. Raghavan. A linear method for deviation detection in large databases. In Proc. of 2th ACM 
International Conference on Knowledge Discovery and Data Mining (KDD), pages 164-169, Portland, Oregon, USA, 1996. 

11. V. Barnett and T. Lewis. Outliers in Statistical Data. John Wiley & Sons, New York, 1994. 

12. S.D. Bay and M. Schwabacher. Mining distance-based outliers in near linear time with randomization and a simple pruning 
rule. In Proc. of the 9th ACM International Conference on Knowledge Discovery and Data Mining (KDD), pages 29-38, 
Washington, DC, USA, 2003. 

13. C. Boutilier, N. Friedman, and Joseph Y. Halpem. Belief revision with unreliable observations. In Proc. of the 15th National 
Conference on Artificial Intelligence and 10th Innovative Applications of Artificial Intelligence Conference (AAAI/IAAI), pages 
ni-UA, Madison, Wisconsin, USA, 1998. 

14. Loreto Bravo and Leopoldo Bertossi. Logic programming for consistently querying data integration systems. In Proc. of the 
18th Int. Joint Conf. on Artificial Intelligence (IJCAl), pages 10-15, 2003. 

15. P. Brazdil, M. Gams, S. Sian, L.Torgo, and W. van de Velde. Learning in distributed systems and multi-agent environments. 
In Proc. of European Conference on Machine Learning - EWSL91, European Working Session on Learning, pages 412-423, 
Heidelberg, Germany, 1991. 

16. M. M. Breunig, H. Kriegel, R.T. Ng, and J. Sander. LOF: Identifying density-based local outliers. In Proc. of the 2000 ACM 
SIGMOD International Conference on Managment of Data (SIGMOD), pages 93-104, Dallas, Texas, USA, 2000. 



52 



17. F. Buccafurri, N. Leone, and P. RuUo. Enhancing disjunctive datalog by constraints. IEEE Transactions on Knowledge and 
Data Engineering, 12{5):845-860, 2000. 

18. K. Eshghi C. Preist and B. Bertolino. Consistency-based and abductive diagnoses as generalized stable models. Annals of 
Mathematics and Artificial Intelligence, ll(l-4):51-74, 1994. 

19. Z. Chen and S. Toda. The complexity of selecting maximal solutions. Information and Computation, 1 19(2):231-239, 1995. 

20. Jan Chomicki, Jerzy Marcinkowski, and Slawomir Staworko. Computing consistent query answers using conflict hypergraphs. 
In Proc. of the 13th ACM Conference on Information and Knowledge Management ( CIKM), pages 417^26. ACM Press, 2004. 

21. A. Ciampolini, E. Lamma, P. Mello, F. Toni, and P. Torroni. Cooperation and competition in alias: a logic framework for agents 
that negotiate. Annals of Mathematics and Artificial Intelligence, 37(l-2):65-91, 2003. 

22. Luca Console, Daniele Theseider Dupre, and Pietro Torasso. On the relationship between abduction and deduction. Journal 
of Logic and Computation, l(5):661-690, 1991. 

23. C. Damasio, L. M. Pereira, and M. Schroeder. Revise: Logic programming and diagnosis. In Proc. of the 4th International 
Conference on Logic Programming and Non-monotonic Reasoning (LPNMR), pages 354-363, Dagstuhl, Germany, 1997. 

24. E. Dantsin, T. Eiter, G. Gottlob, and A. Voronkov. Complexity and expressive power of logic programming. ACM Computing 
Surx'ey, 33(3):374^25, 2001. 

25. A. Darwiche and J. Pearl. On the logic of iterated belief revision. Artificial Intelligence, 89: 1-29, 1997. 

26. J. de Kleer and B.C. Williams. Diagnosing multiple faults. Artificial Intelligence, 32(1):97-130, 1984. 

27. Marc Denecker and Antonis C. Kakas. Abduction in logic programming. Computational Logic: Logic Programming and 
Beyond, pages 402-436, 2002. 

28. P. M. Dung. Negation as hypotheses: An abductive foundation for logic programming. In Proc. of of the 8th International 
Conference on Logic Programming (ICLP), pages 3-17, 1991. 

29. T. Eiter, W. Faber, N. Leone, and G.Pfeifer. The diagnosis frontend of the dlv system. AI Communications, 12(l-2):99-lll, 
1999. 

30. Thomas Eiter and Georg Gottlob. The complexity of logic-based abduction. Journal of the ACM, 42(1):3^2, 1995. 

31. U. Endriss, P. Mancarella, F. Sadri, G. Terreni, and F. Toni. The ciff proof procedure for abductive logic programming with 
constraints. In Proc. of the 9th European Conference on Logics in Artificial Intelligence (JELIA), pages 31^3, 2004. 

32. E. Eskin, A. Arnold, M. Prerau, L. Portnoy, and S. Stolfo. Applications of Data Mining in Computer Security, chapter A 
geometric framework for unsupervised anomaly detection. Kluwer, 2002. 

33. Tom F. and F. Provost. Adaptive fraud detection. Data Mining and Knowledge Discovery, 1(3):291-316, 1997. 

34. G. Mainetto F. Bonchi, F. Giannotti and D. Pedreschi. A classification-based methodology for planning audit strategies in 
fraud detection. In Proc. of the 5th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 
(KDD), pages 15-18, San Diego, California, USA, 1999. 

35. F.Lin and J.H. You. Abduction in logic programming: A new definition and an abductive procedure based on rewriting. 
Artificial Intelligence, 140(1-2): 175-205, 2002. 

36. N. Friedman and J. Y. Halpem. Belief revision: A critique. In Principles od Knowledge Representation and Reasoning, pages 
421^31, 1996. 

37. N. Friedman and J.Y. Halpern. Modeling beliefs in dynamic systems, part i: Foundations. Artificial Intelligence, 95:257-316, 
1997. 

38. Ariel Fuxman, EUiam Fazli, and Renee J. Miller. Efficient management of inconsistent databases. In Proc. of the 2005 ACM 
International Conference on Management of Data (SIGMODj, 2005. 

39. P. Gardenfors. Knowledge in Flux: Modeling the Dynamics of Epistemic States. MIT Press, 1988. 

40. M.R. Garey and D.S.Johnson. Computers and Intractability. A Guide to the Theory ofNP-completeness. Freeman and Comp., 
NY, USA, 1979. 

41. M. Gelfond and V. Lifschitz. The stable model semantics for logic programming. In Proc. of the 5th International Conference 
on Logic Programming (ICLP), pages 1070-1080, Seattle, 1988. 

42. M. Gelfond and V. Lifschitz. Classical negation in logic programs and disjunctive databases. New Generation Computing, 
9:365-385, 1991. 

43. M.R. Genesereth. The use of design descriptions in automated diagnosis. Artificial Intelligence, 24:411^36, 1984. 

44. A. Grove. Two modellings for theory change. Journal of Philosophical Logic, 17:157-170, 1988. 

45. W. van der Hoek H. Aldewereld and J. -J. Ch. Meyer. Rational teams: Logical aspects of multi-agent systems. Fundamenta 
Informaticae, 63(2-3): 159-183, 2004. 

46. S.O. Hansson. Semi-revision. Journal of Applied Non-Classical Logic, l:\5\-V15, 1997. 

47. S.O. Hansson. A survey of non-prioritized belief revision. Erkenntnis, 50(2-3):413^27, 1999. 

48. L. Me J. Kuri and G. Navarro. Fast multipattem search algorithms for intrusion detection. Fundamenta Informaticae, 56(1- 
2):23-49, 2003. 



53 



49. T. Janhunen, I. Niemela, P. Simons, and J.-H. You. Unfolding partiality and disjunctions in stable model semantics. In 
Proc. of the 7th International Conference on Principles of Knowledge Representation and Reasoning (KR), pages 411-419, 
Breckenridge, Colorado, USA, 2000. 

50. D. S. Johnson. Handbook of Theoretical Computer Science, Volume A: Algorithms and Complexity, chapter A catalog of 
complexity classes, pages 67-161. Elsevier and The MIT Press (co-publishers), 1990. 

51. A.C. Kakas and P.Mancarella. Database updates through abduction. In Proc of. the 16th International Conference on Very 
Large Data Bases (VLDB), pages 650-661, Brisbane, Queensland, Australia, 1990. 

52. Antonis C. Kakas and Paolo Mancarella. Generalized stable models: A semantics for abduction. In Proc. of the 9th European 
Conference on Artificial Intelligence fECA/), pages 385-391, 1990. 

53. Antonis C. Kakas, Bert Van Nuffelen, and Marc Denecker. A-system: Problem solving through abduction. In Proc. of the I7th 
International Joint Conference on Artificial Intelligence (IJCAI), pages 591-596, 2001. 

54. H. Katsvmo and O. Mendelzon. On the difference between updating a knowledge base and revising it. In Principles of 
Knowledge Representation and Reasoning, pages 387-394, 1991. 

55. E. Knorr, R. Ng, and V. Tucakov. Distance-based outlier: algorithms and applications. VLDB Journal, 8(3-4):237-253, 2000. 

56. S. Kraus. Negotiation and cooperation in multi-agent environments. Artificial Intelligence, 94(l-2):79-98, 1997. 

57. M.W. Krentel. The complexity of optimization problems. Journal of Computer and System Sciences, 36(3):490-509, 1988. 

58. D.T. Dupre L. Console and P. Torasso. A theory of diagnosis for incomplete causal models. In Proc. of the 10th International 
Joint Conference on Artificial Intelligence (IJCAI), pages 1311-1317, Detroit, Michigan, USA, 1989. 

59. D.T. Dupre L. Console and P. Torasso. On the relationship between abduction and deduction. Journal of Logic and Computa- 
tion, l(5):661-690, 1991. 

60. W. Lee, S.J. Stolfo, and K.W. Mok. Mining audit data to build intrusion detection models. In Proc. of the 4th ACM SIGKDD 
International Conference on Knowledge Discovery and Data Mining (KDD), pages 66-72, New York, USA, 1998. 

61. N. Leone, G. Pfeifer, W. Faber, T. Liter, G. Gottlob, S. Perri, and F. Scarcello. The dlv system for knowledge representation 
and reasoning. ACM Transactions on Computational Logic, To appear. 

62. Nicola Leone, Thomas Liter, Wolfgang Faber, Michael Fink, Georg Gottlob, Gianluigi Greco, Giovambattista lanni, Ldyta 
Kalka, Domenico Lembo, Mamizio Lenzerini, Vincenzino Lio, Bartosz Nowicki, Riccardo Rosati, Marco Ruzzi, Witold 
Staniszkis, and Giorgio Terracina. The INFOMK system for advanced integration of incomplete and inconsistent data. In 
Proc. of the 2005 International Conference on Management of Data (SIGMOD). ACM, 2005. Demo paper. 

63. F. Lin and Y. Zhao. ASSAT: Computing answer sets of a logic program by SAT solvers. In Proc. of the 8th National Conference 
on Artificial Intelligence and Fourteenth Conference on Innovative Applications of Artificial Intelligence (AAAI/IAAI-02 ), pages 
112-118, Menio Pare, CA, USA, 2002. 

64. Fangzhen Lin and Jia-Huai You. Abduction in logic programming: A new definition and an abductive procedure based on 
rewriting. In Proc. of the 17th International Joint Conference on Artificial Intelligence (IJCAI), pages 655-666, 2001. 

65. Jinxin Lin and Alberto O. Mendelzon. Merging databases under constraints. International Journal of Cooperative Information 
Systems, 7(l):55-76, 1998. 

66. D. Makinson. Screened revision. Theoria, 63:14-23, 1997. 

67. I. Niemela and P. Simons. Smodels: An implementation of the stable model and well-founded semantics for normal LP. In 
Proc. of the 4th International Conference on Logic Programming and Nonmonotonic Reasoning (LPNMR), pages 420-429, 
Berlin, 1997. 

68. C. H. Papadimitriou. Computational Complexity. Addison-Wesley, Reading, Mass., 1994. 

69. C. S. Peirce. Abduction and induction. In Philosophical Writings ofPeirce, pages 150-156, 1955. 

70. Simona Perri, Francesco Scarcello, and Nicola Leone. Abductive logic programs with penalization: Semantics, complexity 
and implementation. Theory and Practice of Logic Programming, 5((l-2)): 123-159, 2005. 

71. D. Poole. Representing knowledge for logic-based diagnosis. In International Conference on Fifth Generation Computing 
Systems, pages 1282-1290, 1988. 

72. David Poole. Normality and faults in logic-based diagnosis. In Proc. of the 1 1th International Joint Conference on Artificial 
Intelligence (IJCAI), ^Sig&s 1304-1310, 1989. 

73. S. Ramaswamy, R. Rastogi, , and K. Shim. Efficient algorithms for mining outliers from large data sets. In Proc. of the 2000 
ACM SIGMOD International Conference on Management of Data, pages 427^38, Dallas, Texas, 2000. 

74. A. Rao and M. Georgeff. An abstract architecture for rational agents. In Proc. of 3rd International Conference on Principles 
of Knowledge Representation and Reasoning (KR), pages 439^49, Cambridge, Massachusetts, USA, 1992. 

75. R. Reiter. A logic for default reasoning. Artificial Intelligence, 13(1-2):81-132, 1980. 

76. R. Reiter. A theory of diagnosis from first principles. Artificial Intelligence, 32(l):57-96, 1987. 

77. S. Rosset, U. Murad, E. Neumann, Y. Idan, and G. Pinkas. Discovery of fraud rules for teleconmiunications-challenges and 
solutions. In Proc. of the 5th ACM International Conference on Knowledge Discovery and Data Mining (KDD), pages 409- 
413, San Diego, California, USA, 1999. 



54 



78. Fariba Sadri, Francesca Toni, and Paolo Torroni. Dialogues for negotiation: agent varieties and dialogue sequences. In John- 
Jules Meyer and MiUnd Tambe, editors, Proc. of the 8th International Workshop on Agent Theories, Architectures, and Lan- 
guages (ATAL), pages 69-84, 2001. 

79. A.L. Selman. A taxonomy of complexity classes of functions. Journal of Computer and System Sciences, 48(2):357-381, 
1994. 

80. T.C.Son, P.H.Tu, and C.Baral. Planning with sensing actions and incomplete information using logic programming. In Proc. 
of the 7th International conference on Logic Programming and Nonmonotonic Reasoning (LPNMR), pages 261-274, Fort 
Lauderdale, FL, USA, 2004. 

81. T.Eiter, W.Faber, N.Leone, G.Pfeifer, and A.Polleres. A logic programming approach to knowledge-state planning: Semantics 
and complexity. ACM Transaction on Computational Logic, 5(2):206-263, 2004. 

82. L.G. Valiant. The complexity of computing the permanent. Theoretical Computer Science, 8(2):189-201, 1979. 

83. W. van der Hoek and M. Wooldridge. On the logic of cooperation and propositional control. Artificial Intelligence, 164(1- 
2):81-119, 2005. 

84. M.Y. Vardi. The complexity of relational query languages. In Proc. of the 14th Annual ACM Symposium on Theory of 
Computing (STOC), pages 137-146, 1982. 

85. W.W. Vasconcelos, D.S. Robertson, C. Sierra, M. Esteva, J. Sabater, andM. Wooldridge. Rapid prototyping of large multi-agent 
systems through logic programming. Annals of Mathematics and Artificial Intelligence, 41(2-4): 135-169, 2004. 

86. D. N. Walton and E. C. W. Krabbe. Commitment in Dialogue: Basic Concepts of Interpersonal Reasoning. 1995. 

87. M. J. Wooldridge and S. Parsons. Languages for negotiation. In Proc. of the 14th European Conference on Artificial Intelligence 
(ECAl), pages 393^00, 2000. 

88. R. Ben-Eliyahu Zohary and R. Dechter. Propositional semantics for disjunctive logic programs. Annals of Mathematics and 
Artificial Intelligence, 12(l-2):53-87, 1994. 



55 



